FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 

TC ATAC AT T AC T GC AG T AAC AC T CCACCAT ATAGACCCGGCT T TACCT TAT AT CAG T GACAC T GG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 

CTACCAT T T ATGT TCGT TATAAGCAAGT T CATGCTCTGAGT CCTGAAGAGAACGT TATC ATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TAT AT AT GT T T GT TCAGACCAT CCT TT CC TACCAAAT GCAGCCCAAAAT CCAT GGCAAACAAG T C 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGT T TT GCACAGT GGCAAT TT TGGGACT GAT T TAGAACAGAAACT CCAT TGGAACCCCGAGG 
ACAAAGGT T ATGT GC T T CACAT GAT CACT ACT GCAGCAGAAT GGTC TAT GT CAT TTTCCTTCTTT 
GGTTTTTTCCT GAC T TACAT T CGTGAT T T T CAGAAAAT T T CT T TACGGGT GGAAGC CAAT T TACA 
T GGAT TAACCC TC TAT GACACT GCACCT TGCC C TAT TAACAAT GAACGAACACGGC TACT T T CCA 
GAGATAT T TGAT GAAAGGAT AAAATAT T T CT GTAAT GAT TAT GAT T C T CAGGGAT TGGGGAAAGG 
T T CACAGAAG T TGC T TAT T CT T C TC TGAAAT T T TCAACCACTT AATCAAG GCT GACAGT AACAC T 
GAT GAAT GCT GATAATCAGGAAACAT GAAAGAAGCCAT T T GATAGAT TAT T CTAAAGGAT AT CAT 
CAAGAAGACTAT TAAAAACACC TATGCCTATAC T T T T T TATCT CAGAAAATAAAGT CAAAAGAC T 
ATG 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 

LC I AT I YVRYKQVHALS PEENVI IKLNKAGLVLG I LS CLGLS I VANFQKT TLFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGWLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 

LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CT T ACACT TCGCCATGAGT TT CCT C AT C GACT CCAGCATCATGAT TAC C TC CCAGAT AC T ATTT T TTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
|_, TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
P CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
C3 CATT CTCAGCCCAAAACAT GGGATC TTAT CCATAGAACAGCTCATCAGCCGGGTT GGTGTGATT GGAG 

O 1 TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
* CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
'? I CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
mi AACCATCAGGT TT CT GGGGAATGAT AAAAAGT GTTACCACT TCAGCAT CAGGAAGTGAAAATCT TACT 

s CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
0 ATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 
til ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
W GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTAT CTGGGAATCCAAT TTGATGT 

GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
|p TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 

CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAA.TACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAAT GGAACCAGGGCCT GACAT T TTATAAACAAACAAAATGCT ATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACAT GAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCC CGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTT GAGATTGACT CATTAAAAT CAGAGACT GTAACAAAAAAAAAAAAAAAAAAAkAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 



MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMM.LSGFGAVNCPYTYMSYFLROTTDTDILALERRLLQ 
TMDM 1 1 S KKKRMMARR TMFQKGE VHNKP S G FWGM I KS VT T S AS GS ENL T LI QQEVDALE E L SRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
H' FVS S VLLI RMSMPLE YRT 1 1 TEVLGELQ FNFYHRW FDV I FLVSAL S S I L FL YLAHKQAPEKQMAP 

0 
o 

^ Important features: 

yi 

hi Signal peptide: 

Iff amino acids 1-23 



5 ^ Potential transmembrane domains: 

P % amino acids 37-55, .81-102, 150-168, 288-311, 338-356, 375-398, 

m 425-444 

CI 

III N-glycosylation sites. 

^ amino acids 67-70, 180-183 and 243-246 

rll 



Eukaryotic cobalamin -binding proteins 

amino acids 151-160 
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FIGURE 5 

AGCAGGGAAATC CGGAT GT CTCGGT TAT GAAGTGGAGCAGT GAGT GTGAG CC T CAACATAGT TCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GT CATGCAGAAT TATGGGGAT CACCCT T GT GAGCAAAAAGGCGAACCAGCAGC TGAATT T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT T TGCAGCC T AT T GT TACAACT CATC TGAT ACT T GGAC T AACTCGT GCAT T CCAGAA 
AT TAT CACCACCAAAGATCCCATAT TCAACACTCAAACT GCAACACAAAC AACAGAATT T AT T GT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGC TC CAGCT TCCACTT CTAT TC CACGGAGAAAAAAAT T GAT T TGT GT CACAGAAGTT T T TAT G 
GAAACTAGCACCATGT CTACAGAAACTGAACCAT TTGT TGAAAATAAAGCAGCATTCAAGAATGA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAAT GAT CGAAACCAAAGT AGTAAAGGAGGAGAAGGCCAAT GATAGCAACC CTAA 
T GAGGAAT CAAAGAAAAC TGATAAAAACCCAGAAGAGT CCAAGAG T C CAAGCAAAAC TACCGT GC 
GATGCCTGGAAGCTGAAGTTTAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTT T CTTT 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 

CCCTTGGTTCC TAACTGGAAT CAGC TCAGGACT GCCAT T GGAC T AT GGAG T GC ACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 

AAAGCCAAGGAGTCACT GAGACCAAGGCT T T C TCTACT GAT T CCGCAGC TCAGACCC T TT C T TCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
GC TAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACT GT CAGCAGGGACTGTAAAC 
ACAGACAGGGTCAAAGTGTTT TCTC TGAACACAT TGAGTT GGAATCACTGT TTAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC T C T T AT AAAT T T C TAT T T T TAT C T GAG T T AC AGAAAT GAT T AC T AAG G AAG AT T 
AC T CAGT AAT TTGT TT AAAAAGTAATAAAAT TCAACAAACAT T T GC T GAATAGC TAC TAT AT GT C 
AAGT GCT GT GCAAGGT AT TACACTCT GTAAT TGAATATT AT TCC TCAAAAAAT T GCACATAGTAG 
AAC GC TAT C T GGGAAG CT AT t T T T T T CAG T T T T GAT AT T T C T AGC T TAT C TAC T T CCAAAC T AAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TATTAACATACCTAAGAAGTACAT TGTTACCTCTATATACCAAAGCACAT TT TAAAAGTGCC 
AT T AACAAATGTATCACTAGCCCTCC T TT TTCCAAGAAGAAGGGAC T GAGAGATGCAGAAATAT T 
T GTGACAAAAAATTAAAGCAT T TAGAAAACT T 



FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L S LAGKDQVE TALKAS FETC S YGWVGDG FWI SR I S PNPKCGKNGVGVL I WKVPVS RQFAAYC YN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I CVTEVFME TS TMS TETE P FVENKAAFKNEAAGFGGVPTALLVLALL FFGAAAGLG FC YVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 

GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 

CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 

CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 

GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 

GGAGGACAC GCAGCACAAAT T GCGCAGCGCGGTGGAAGAGAT GGAGGCAGAAGAAGC TGC TGCTA 

AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 

ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 

TGGACAAAT GGTCT T T T CAGAGACAGT TATCACAT C T GT GGGAGACGAAGAAGGCAGAAGGAG CC 

ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 

ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 

GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 

ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 

ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 

CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 

GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 

C CAGGAGCT GGAGGACC TGGAGAGGAGCC TGAC TGAAGAGATGGCGCT GGGGGAGCC T GC GGC TG 

CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGA.TCTGGACCAGGCTGTGGGTAGATGTGCAA 

TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 

CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 

CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 

GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 

TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 

TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 

AGAT GAAATG T T C T G T TC ACCC TGCAT T ACAT GTGT T TAT TCAT CC AGCAG T GT TGC T CAGC T CC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 

AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 

CAAGT CACACAGCTAGTGAAGAC CAGAGCAGT T TCAT CT GGT TGTGACT C T AAGC TCAGT GC TC T 

CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 

TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 

CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 

AT TGACACT GTCCCT CTT T GGCAGT TGCAT TAGTAACT T T GAAAGGT AT AT GAC TGAGCGT AGCA 

TACAGGT TAACCTGCAGAAACAGTACT T AGGTAAT TGTAGGGC GAGGAT TATAAATGAAAT T T GC 

AAAATCACT TAGCAGCAACTGAAGACAAT TAT CAACCACGT GGAGAAAATCAAACCGAGCAGGGC 

T G T GT GAAACATGGT T GTAATAT GCGACT GCGAACAC TGAACT C TACGCCACTCCACAAATGAT G 

TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 

CATGATT GTATAAGCATGCT TTCTTTGAGT TT TAAAT TATGTATAAACATAAGTTGCATTTAGAA 

AT CAAGCATAAATCAC T T CAAC TGCAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 

RSAWEMEAEEAAAKASSEWLANLPPSYHNETNT IHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 

RS LTEEMALGE PAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N- glycosylate on site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 

GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 

GCCTAATGCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 

AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 

CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 

ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 

GAGTATATTGACCACAAGCTCAGAGGAGT GCAGAACCCTGTAGCCCGCT GCATCATGTGCTGTT T 

CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 

TCGC CAT CT ACGGGAAGAAT TTCTGTGTCT CAGCCAAAAATGCGT TCATGCT AC TCAT GCGAAAC 

ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 

GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 

ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 

GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 

GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 

AGAT T C T GGGC AAGAAGAACGAGGC GC C C C C GGACAAC AAGAAGAGGAAGAAG TGACAGC TCCGG 

CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 

CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 

TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 

AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 

GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 

GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 

AAGAT T T TAT T AAAGATAT T T T G T T AACT C 
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FIGURE 10 



RTRGRTRGGCEKVP INT S CNPTAHLVNS SCPGLMC VFQG YS S KGL I QRS VFNLQ I YGVLGLFWTL 
NWIALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 

EYI DHKLRGVQNPVARCIMCCFKCCLWCLEKFI KFLNRNAY IMI AI YGKNFCVS AKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins . 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
S ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
p CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
§! GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
Ul TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
W CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

s 

?1 G GGC AAC G AGACAGT T G TGGCAG GC CC CGAGGG.C TAT GAGAC C C AGT GGTGG GAT GC CC CGAGC AT T G 

|j| TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
Q AACAGC C T GAT G CAG AC CGAG GAGT GC C CAC C T ATG C T AGACG C C AC ACAG C AG C AG C AG C AGCAGGT 

fU GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
O ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
iy ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFYTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGL 1 1 FLLCT L FI S LRS S DHRQVNS LMQTEEC PPMLDAT QQQQQQVAACEGRAFDNEQDGVT YS Y 
S FFHFCLVLASLHVMMTLTNWYKPGETRKMI S TWTAVWVKICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGAC TCC TGTAACC C TCC T CCAGGATGAACCACC TGCCAGAAGACAT GGAGAAC G 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 

T GAAT GGAGGC AT T GAGAACAC AT T AGAG AAGGAGGT GAT GCAGTAT GACTAC TAT T CT T CAT AT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CT TGCCT GGAT TGAGACGTGGTTCC T GGATT T CAAAGT GT T ACC TCAAGAAGCAGAAGAAGAAAA 
CAGACTCCT GATAGTTCAGGATGCTTCAGAGAGGGCAGCACT T ATACCT GGTGGTCT T TCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACTTTTAGAACTATGAGTACTACT T TTGT TAAATGTGAAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGT GGAG TCTC CC TG TC GACAGTAAAGTT GAAATGGTGACGT C 
CAC TGCT GGCT T TAT TGAACAGCTAAT AAAGAT T TAT T TAT T GTAATACC T CACAAACGT T GTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
T CAG TGAGACT GAGC CT GAT GT GT T AACAAATAGGT GAAGAAAGT CT T G T GCTGTAT T CC T AAT C 
AAAAGAC T T AATATAT T GAAGTAACAC T T T T T T AGTAAGCAAGAT ACC T T T T TAT TT CAAT T CAC 
AGAATGGAAT TT TT TT GT TT CATGTCT CAGAT TTAT T T TGTATT TCT T T T T TAACAC TCTACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T T GC AAAAGGGGAAGAAAGGAAT T GCGAAT AC AT G T AAAAT GTCACCAGACAT T TGTAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCAT TCATATCATGT TTCCTT TGCGTTCAGCCAAT TTCAAT TAAAAT GAA 

CTAAATTAAAAA 



FIGURE 14 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVT FDLLF 
VT LLW HE LNVNGGI ENTLEKEVMQYDYYS S YFDI FLLAVFRFKVL I LAYAVCRLRHWWAIALT T 
AVTSAFLLAKVILSKLFSQGAFGWLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLSDGQFYS PPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

Q amino acids 1-20 

1 

■*fl Transmembrane domains: 

:J! amino acids 54-72, 100-118, 130-144, 146-166 

jfy N-irtyristoylation sites. 

* amino acids 14-20, 78-84, 79-85, 202-208, 217-223 

Q 
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FIGURE 15 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
f** CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
0 GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
O TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
m CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
HI TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
:t TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
in TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
ill CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
f| ! CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
*. CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
W ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
Iff GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
fl GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 

AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
;!f GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
O GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
fit GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCGCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
T C CAG G CG GAC CCTGGGGGC CAG T GAAG GAAGC T C C C GGAAAGAGC AGAG G GAGAG C GG G T AG GC 
GGC TGT GT GACTCTAGT CT T GGCCCCAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGAT GC 
T T TAGGAACATGT T TTGCTTTTT TAAAAT AT ATAT AT T T AT AAGAGATCCT T TCC CATT TAT T CT 
GGGAAGAT GT T T T TCAAAC T CAGAGACAAGGACT T TGGT TT T TGTAAGACAAACGATGATATGAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGP1WRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSIiAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LIAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGAT GAAT CC TT AGAT T CCAAGAC TAC T TT GACAT CAGATGAGT CAGT A 
AAGGACCATACTAC TGCAGGCAGAG TAG T TGC TGG T CAAATAT T TCT T GAT TCAGAAGAAT C T GA 
AT TAGAAT CC TCTAT T CAAGAAGAGGAAGACAGCC T CAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATATCAGCTT TCTAGAGTCTCCAAAT CCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T T C C T AGATAAGGAG TAT GAT GAAT G TAC AT C AG AT G G GAG G GAAGAT G GC AGAC TGTGGTGTG 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGAT GCAGGAAGCAGAAATGATG T AT CAAACT GGAAT GAAAATCCT TAAT GGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCT GGAGAGAGT GT CATAT GCTCT TT TAT T T GGT GAT TACTT GC CACAGAATATCCAG 
GCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT TAG TGGAAGGCT 
AAT AAT AT TAAC ATCAGAAGAAT TTGTGGTT T AT AGCGGCCACAAC T T T T T CAGC T T T CAT GATC 
CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
AACACAT GGAATC TACAT GTAAATGAAAG T T GG TGGAGT C CACAATT T T T C T T TAAAAT GAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAATTGTCAGAATCATTTTTTACATTAGATTATCATAATTTTAAAAATTTTTCTTTAGTTTTTCA 
AAAT T T T GTAAAT GGT GGCTATAGAAAAAC AACATGAAATAT TATACAAT AT T T TGCAACAAT GC 
CCTAAGAAT TGT TAAAAT TCATGGAGTTAT TTGTGCAGAATGACTCCAGAGAGC TCTACTT TCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAAC T CAT T T T T AAT AAAAT T AT GT C TAAGAT T AAAAA?J^W^AAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEmYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 

S SQAKALVY YT FGALGGNL I AHMVLVSRL 



Important features: 
H Signal peptide: 

amino acids 1-21 

PI 



m 

rii 



? n 



N-glycosylation sites . 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 



«j N-myristoylation sites. 

In amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 



Q 

III Glycosaminoglycan attachment site. 

amino acids 267-271 



Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AAT T CAGAT T T TAAGCCCAT T C TGCAGT GGAAT T T CAT GAAC TAGCAAGAGGACAC CAT C T T CT T 
GTATTATACAAGAAAGGAGTGTACCTAT CACACACAGGGGGAAAAATGCT CT TT TGGGTGCTAGG 
C C T CC T AAT CCTCTGTGGTTT TC T GTGGACT CGTAAAGGAAAAC T AAAGAT T GAAGACAT CAC T G 
ATAAGTACAT TTT TATCACTGGATGTGACTCGGGCT TTGGAAACTTGGCAGCCAGAACTT T TGAT 
AAAAAG G GAT T T CAT G T AAT CGCTGCCTGTCT GAC T GAAT CAGGAT C AAC AGC T T T AAAG G C AGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
C C TGTT T GGAC TCATCAGTGTGACAC TAAATATGCT TCCT TT GGTCAAGAAAGC T CAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCAT TGAACCAGGATT GT T CAAAACAAACT T GGCAGAT CCAGTAAAGGT AAT T GAAAAAAAAC 
T CGC CAT T T GGGAGC AGC TG T C T CCAGACATC AAACAACAAT AT GGAGAAGG T T AC AT T GAAAAA 
AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

TTT TC T GGATACC T CT GT C T C ACAT GCCAGCAGC TTT GCAAGAC T T T T TAT TGTT GAAAC AG AAA 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CAT CT CT TAT C TAAAT AT TAAAAGAT AAG T CAAC CCAAAAAA?^^AAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 



FIGURE 20 



MLFWVLGLLI LCGFLWTRKGKLKIEDI TDKYI FI TGCDSGFGNLAART FDKKGFHVTAACLTESG 
STALKAETSERLRTVLLDVTDPEWKRTAQm^KNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 

repiewlfglisvtlnmlplvkkaqgrvinvssvggrlaivgggytpskyavegfndslrrdmk 
afg^/hvsciepglfktniju)pvkviekkiaiweqlspdikqqygegyieksldklkgnksy™md 

LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

GlycosaminoglYoan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 

GG T GAAGC C AAGAACAG CAT T AC T GAT T C C CAAAT GGAT GAT G T T GAAG T T GT T T AT AC AAT T GA 
CATTCAGAAATATATT CCATGCTATCAGCT TT TTAGCT TTTATAAT TCTT CAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAAT GT CAAAAAGAATGTGGTAGGTTGGTACAAAT TCCGT 
CGT CATTCAGATCAGAT CATGACGT T TAGAGAGAGGCTGC T T CACAAAAACT T GCAGGAGCAT T T 
T T CAAACCAAGACCT T GTTTTTCTGC TAT T AACACCAAGT AT AATAACAGAAAGCTGCT CTACTC 
H AT CGACT GGAACAT TCC T T AT ATAAACC T CAAAAAGGAC TTT T TCACAGGGTACCT T TAGT GGT T 

*~ GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
ffj TTT TAGCCGAGCAGTACAAACACACAGCT CTAAAT T T T T T GAAGAAGATGGATCCT TAAAGGAGG 

|y TACATAAGATAAATGAAAT GTATGC T T CAT T ACAAGAGGAAT TAAAGAGTATAT GCAAAAAAGT G 

If! GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
W GAAAAGGAGAGGAGCACAGAT T CAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC T CAGGAGA 

*** ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 

3 

M ATG TCTT TAAAAAATAGACATG T T T C T AAAAGTAGC T GT AAC TACAAC CACCAT CTCGATG T AGT 

Ifl AGACAAT C TGAC CT TAAT GGTAGAACACAC TGAC AT T CCT GAAGCTAGTCCAGC TAGTACACCAC 

O . AAAT CAT T AAGCATAAAGC C T TAGAC T T AGAT GAC AG AT G G CAAT T CAAGAGAT CTCGGTTGTTA 

rs £ 

S V GATACACAAGACAAACGAT C TAAAGCAAAT AC T GGTAGT AGT AAC CAAGATAAAGCATC CAAAAT 

O 

pi G AGC AG C C C AGAAAC AGAT GAAG AAAT T G AAAAGAT GAAG GGTTTTGGT G AAT AT T C AC GG T C T C 

C T ACAT T T TGAT CC T T T TAACC T TACAAGGAGAT TT T T T TAT TT GGCT GAT GGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
T G T T T GC AGTAATACACAG ATAAC TC T TAG T GC AT T T AC T T CAC AAAG TAC T TT TTC AAAC AT CA 
GATGCT TT TAT T TCCAAACCT T T T T T T CAC CT TT CACTAAGT TGT TGAGGGGAAGGCT TACACAG 
ACACAT TCT TTAGAATTGGAAAAGTGAGACCAGGCACAGT GGCTCACACCT GTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 

GAGACCAT GT CTATTAAAAAAT AAAATGGAAAAGCAAGAAT AGCC T TAT TTT CAAAAT AT GGAAA 
GAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CAT TAT GGCTAGAGT T GCCAGATAAAATGCT GGAT AT CAT GCAAT AAAT T T GCAAAACATCAT CT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEWEQALKKILSNWKbTWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPS 1 1 TESCS THRLEHSLYKPQKGL FHRVPLWANLGMSEQLGYKTVSGSCMS TGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDWRLKREIEPCRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

[me 

!!:. Important features: 

fi\ Signal peptide: 

hi amino acids 1-19 

Iff 

S;I N-glycosylation sites. 

|'|| 

amino acids 75-79, 322-326 

111 N-myristoylation site. 

M amino acids 184-154 

»t Growth factor and cytokines receptors family. 

HI 

amino acids 134-150 
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FIGURE 23 



GGCACAGC CGCGCGGCGGAGGGCAGAGT CAGC CGAGCCGAGT C CAGCC GGAC GAGCGGAC CAGCGCAGGGCAGC CCAA 

GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 

GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 

GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 

CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 

TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 

GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 

AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 

AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 

CAAGAGAT GGAGGAGAAGTACAGCCT GCAAAGTAAAGAT GAT TTCAAAAGT GT GGT CAGC GAACAGT G CAAT CATTT G 

CAAGC T GT CTT T GCTT CACGT TACAAGAAGTTT GAT GAAT T CT T CAAAGAACT ACT TGAAAAT GCAGAGAAAT CC CT G 

AAT GAT AT GTT T GTGAAGACATAT GGCCATT TATAC AT GCAAAATT CT GAGCTATTTAAAGAT CT CT T CGTAGAGTT G 

AAACGTTACTACGTGGT GGGAAAT GT GAACCT GGAAGAAAT GCTAAAT GACTT CT GGGCT CGCCT CCT GGAGCGGAT G 

TTCCGC CT GGT GAACT CCCAGT ACCACT TTACAGAT GAGTAT CTGGAAT GT GT GAGCAAGTATACGGAGCAGCT GAAG 

CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 

AT GAT CTACT GCT C C CACT GC CGGGGT CT C GTGACT GT GAAGC CAT GTTAC AACT ACT GCT CAAACAT CAT GAGAGGC 

T GTTT G GCCAACCAAGGGGAT CT C GATT TT GAAT GGAACAATT T CAT AGAT GC TAT GCT GATGGT GGCAGAGAGGCT A 

GAGGGT CCTT T CAACATT GAAT CGGT CAT GGAT CCCAT C GAT GTGAAGAT TTCTGATGCTATT AT GAACAT GCAGGAT 

AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 

TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 

AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 

GTTT GCAACGAT GAGAGGAT GGCT G CAGGAAACG GCAAT GAGGATGACT GTT GGAATGGGAAAGGCAAAAGCAG GTAC 

CT GTTT GCAGT GACAG GAAAT GGATTAGC CAAC CAGGGC AACAACC CAGAGGT CCAGGT T GACACCAGCAAACCAGAC 

ATACTGATCCTTCGTCAAATCAT GGCTCTTCGAGTGAT GAC CAGCAAGATGAAGAATGCATACAATGGGAACGACGTG 

GACTTCTTTGATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAG 

TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 

CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 

AAAAAGT GTT CAT CAAAAAGT TAAAAGGCAC CAGT TAT CACTTTTCTAC CAT C CTAGT GACTTT GCTTTTTAAAT GAA 

T GGACAACAAT GTACAGT T TT T ACTAT GT GGCCACT GGTTTAAGAAGT GCT GACTT T GT TT T CT CATT CAGTT TT GGG 

AGGAAAAGGGAC T GTGCATTGAGT TGGTT CCT GCT C CCC CAAAC CAT GT T AAACGT GGCTAACAGTGTAGGTACAGAA 

CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 

TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 

T CTGAAATATTAAATAGCT GT ACAGAAGCAGGTTTT AT TT AT CAT GT T ATCTT ATTAAAAGAAAAAGCC CAAAAAGC 



FIGURE 24 

MARFGLPALLCTIAVLSAALIJ^LKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGlSr^^ 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGIAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFinAMLMVMRL^ 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSWCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADS AGVRPGAQAYLL T VFC I L FLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCAT CCCCAGGGAT CAGGAGCCTCCAGCAGGGAACCT TCCAT TAT AT TCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCT TCTAGGAGAT GGCT CCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGAT TGGTT CCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAAT GT GAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCAT T C CAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGC GCCCAC T C T T CCAAT TAAACAT T CT CAGCCAAGAAGACAGT GAGC AC ACC T ACC AGAC AC T C 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTG TC AT T TAACC T TAAATGCAAT CAGGAAAGTAGCAAACAGAAGT CAATAAAT AT TT T T 
AAAT GT CAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPI^LMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



Important features: 
Signal peptide: 

amino acids 1-22 

H 

P| N-rayristoylation sites. 

Q amino acids 27-33, 46-52 

s 

w 
in 
iii 

Fll 



CI 
ill 
O 
III 
P 
fll 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 

AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCT.GCC 

CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTAGCTGA 

TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 

ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 

CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 

AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 

GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 

CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 

TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 

GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 

GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 

GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 

T TCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTAT CTATCT T CCACTGAGAGGGACCTA 

GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCTiACCTCTGCCAGGGC 

AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 

AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 

CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 

CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 

ATGCCAGAGCAAGAC T CAAAGAGGCAGAGGT T T T GT T CTC AAAT AT T T T T TAATAAATAGACGAA 

ACCACG 



FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEWRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGPCAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTTTATGATTCTTATACTAATTTATACAAAGATATTAAGGCCCTGTTCATTAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
T Z AGT TAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 

y 

yi 
UP 
fll 



O 
ri i 
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FIGURE SO 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVS SATDAP IRDWAFFPPS FLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



amino acids 3-18 



Important features: 
Signal peptide: 

amino acids 1-15 

H Growth factor and cytokines receptors family: 

Q 
Q 

m 
w 

II! 
W 

ru 

O 

m 

Q 

ru 
o 

fit 
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FIGURE 31 

GTT TGAAT TCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGT CCCAGGCTACCAGT T 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT T T GCATCTGTT T T GATAAAT GATGT TGACACCCT CCACCGAAT T C T AAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
T TCAGAT T CAT CACCACCCTT C T GGT TCACAT T T TCAT T T CAT TGGT TAT T TT GGGAT T G T T GT T 
TGTCTGCGGT GT TT TATGGTGGCTGTAT TATGACTATACCAACGACCTCAGCATAGAATT GGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTT GAT TTTTGTTCT C AGAAAGAGAAT AAAAT T GACAG T T G AGCT T T TC C AAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAA7yVATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT T CCGAGAATCAT TGT CAT GTACAT GCAAAACGCAC T GAAAGAACAGCAGCAT GGT GCATT GT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATACTACAACTGCTATTAATGGGACAGATTTCTGTACATCAGCAAAAGATGC 
AT TCAAAATC T T GT CCAAGAAC T CAAGT CACT T T ACAT CT AT TAAC T GCT T TGGAGAC T TCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAAT GATGGAT CGTCAGAAAAGCC C T ACT T TATGGAT CAAGAAT T TCT GAGT TT CGTA 
AMAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAACTCCAGGCCAT TGTGAGATAGATACCCAT T T AGGTAT C TGT ACC T GGAAAACAT T 
T C C T T C T AAG AGC C AT T T AC AGAAT AG AAG AT GAG AC CAC T AG AGAAAAG T TAG T GAAT T T T T T T 
T TAAAAGACCTAATAAACCCTAT TC T T CC T CAAAA 



10063532 



Page 183 



FIGURE 32 

MSGRDTILGLCIIJ^iiALSLAlYM 

LDTERENMKCVLGFAI VS TG I TAVLLVL I FVLRKRI KLT VE L FQ I TNKAI S SAPFLL FQPLWT FA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRS KNDPPDHP I LS S L S I LFFYHQGTWKGS FL I SWR I PR 1 1 VMYMQNALKE QQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQE FLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-2 0 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTO\AACAAGGGTGTCATTCTGATATTT ATGA GGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATT CAAACAAAGAA&CGGCAAAGAAGAT T AAAAGGC CCAAGT T CAC TGTGCCTCAGAT CAACT GC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGGCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAA.GGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 

L.,v CGAAAAG T CCAGCT GCCCAAGCAGGT GAGACCACAAAAGCC TATCAGAGGCCACC TAT T CCAGGG 

.?«•=* 

' n ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
Jif CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
Q GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
CP AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
III GGATGTCAGCC TGGGACT T GT TCCAAAAGAAGAAT T GAGCACACAGTC TT T GGAGCCAG TAT CCC 

Iff TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
~fl CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
*H CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
fll ACACGAAT TCTCGAGATCTGAAGACAGCCATAGAGAAAAT TAC TCAGAGAGGAGGACTT TCTAAT 

5 GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
f«5 GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 
AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACA7yVCGGCTTCTACTC 
P GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
f|| ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
f| GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
»* AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
211 TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCAC 
AGCCTCGGAAC TGAA T TCAGAGCAGGCAGAGCACCAGCAAGTGC TGCT TTACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAA.TGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
T GAT CACAAAC G TATAGAAT GAGCCAAAAGGC T AC AT CAT GT T GAGG G T G C T GGAGAT T T T ACAT 
T TTGACAATTGT TT TCAAAATAAATGT TCGGAATACAGTGCAGCCCT TACGACAGGCTTACGTAG 
AGCT T T T GTGAGAT T T TT AAGT T GT TATT TC TGAT T T GAAC TC T GTAACCC TCAGCAAGT T TCAT 
T T T TGT CATGACAATGTAGGAAT TGC T GAAT TAAAT GT TT AGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAkAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ES FI VLE SKPKKGVT YPSALT YS S SKSPAAQAGET TKAYQRPP I PGT TAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVVVVMVDGWPTDKVEEASRLARES 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKS KPNKRKLMIL I TDGRS YDDVRI PAMAAHLKGVI TYAI GVAWAAQEELEVI ATHPARDHS FF 
VDEFDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CCG AGC ACAGGAGATT GC CT GC GT T TAG GAGGT GGCT G CGTT GT GGGAAAAGCT AT CAAGGAAG AAAT T GC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATGTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGG ATG GCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GG AT GTCACT GAGAT C CC TC AAAT GGAGCC TCCT GCTGCT GTCACTCC TGAGT T TCT TT G TG ATGT GG T AC 
CTC AGC CT T CCC C ACT AC AAT GTGAT AG AACGCGT GAACT GGAT GT AC TT CT AT GAGT AT GAGCCG AT TT A 
i: C AG AC AAGACTT TCAC TT C AC ACT TCGAGAGC ATTC AAACT GCT CT CAT CAAAATCC AT T TC T GGT C ATTC 

p| TGGT GACCTCCC ACCC TT C AGATGTGAAAGCC AGGC AGGCC ATT AGAGT T ACTT GGGGT G AAAAAAAGT CT 

fl TGGT GGGG AT AT GAGGTT CTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGAC AAAATGTTGGC 

W str- 
ip ATTGTCCTT AGAGGAT GAACAC CT TCTT T AT GGT GAC AT AAT CCGACAAG AT TT TT TAG ACACAT AT AAT A 

Ul ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 

V* AAGACAGACACT GATGTT TTC AT C AAT ACT GGCAAT TT AGT G AAGT AT CT T TT AAACCT AAACC ACTC AGA 



11 i: 



GAAGTT TTT C AC AGGT T ATCCT CT AAT T GAT AAT T ATT CCT AT AGAGGAT T TT ACC AAAAAAC CCAT AT TT 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 

CT GT TT GAAT TT AT T AAAAGTGAAC ATT CAT AT T CC AGAAGACAC AAATCT T T T CT T TCT AT AT AG AAT CC 

Wi 

i*l AT T T GGAT GT CT GT C AAC TGAGACGTGT GATT GC AG CCCAT GGCTT T T CT T CCAAGGAGATC AT C ACT TTT 

f|| T GGC AGGT C ATGCT AAGG AACACC AC AT GC CAT TAT TAA CT T CAC AT T CT ACAAAAAGC C TAGAAGGACAG 

Q GAT ACC TTGT GGAAAGTGTT AAAT AAAG T AGGT ACT GT GGAAAAT T CATGGG GAGGT C AG TGT GC T GGCTT 

ACACTGAACT GAAACTCATGAAAAACCCAGACTGGAGACT GGAGGGTTACACTTGT GATT TATT AGTCAGG 
CCCT TC AAAG AT GAT ATGTGGAGG AAT T AAAT AT AAAGGAAT T GGAGGTT T TT GCT AAAG AAATT AAT AGG 
ACC.AAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CT AG GCTGTAAAAACAAAACAATGT AGAGT TTT ATT TAT T GAAC AATGTAGT CACT TG AAGGT TTT GT GT A 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
C T GAAC AAAATT TT ACCT GT T T TTGGT C AT TT AT AAAGT ACTTCAAGATGT T GC AGT AT T TC AC AGTT AT T 
ATT ATT T AAAAT TACT TC AACT TTGTGT TT TT AAAT GT T T T GAC GATT TC AAT ACAAG AT AAAAAGGAT AG 
T GA ATC ATT C TT TACATGC AAACATTT T CC AGTT AC TTAACTGAT CAGTT T AT T ATTG AT AC ATC ACT C CA 
T T AATGTAAAGT CAT AGGT CAT T AT TGC AT ATC AGT AAT CTCT TGGACTT T GTT AAAT AT TT TACT GT GGT 
AAT AT AGAGAAG AAT T AAAGCAAG AAAATC TGAAAA 
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FIGURE 36 



MASMjWTVLPSRMSLRSLKWSLLLLSLL^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPL I DNYS YRGFYQKTHI S YQE YPFKVFPPYCSGLGYIMSRDLVPRI YEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 



Important features : 
Type II transmembi 

amino acids 20-39 



Type II transmembrane domain: 



y| N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

FU 

f*| Glycosaminoglycan attachment site. 

Ill amino acids 239-243 



Ly-6 / u-PAR domain proteins. 



amino acids 23-37 

III 



N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGC TCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGC TGGACGCAGTTGGGGCTCACT TTTCTTCAGCTCCTTCTCATC 
TCGT CCTTGCCAAGAGAGTACAC AGTCAT TAATGAAGCC T GCCC TGGAGCAGAGTGGAATATC ATGTGTCGGGAGTGCT GT G 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACT CCTGC C TGAT CCACCCAGGT T GTAC CATC T TTGAAAAC TGCAAGAGC TGC CGAAAT GGCTC ATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
TAAACCTGGGT T TGTCATCCAAC TAAGAT TTGTCAT GT TGAGTC TGGAGTT T GACTACATGTGCCAGT ATGAC TATGTTGAG 
GT TCGTGATGGAGACAACCGCGAT GGCCAGAT CAT C AAGC GTG TC TGTGGC AAC GAGCGGCCAGCTC C TATC CAGAGCATAG 
GATC C TCAC TC C ACGTC CTC T TCCAC TCCGAT GGC T C C AAGAAT T TTGACGGT TT C CATGCCAT TTATGAGGAGATCAC AGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TAT AC TGGGCAGCGC TGTGAAAATC TC C T TGAAGAAAGAAACT GC TC AGAC C C TGGGGGC C CAGTCAATGGGTACCAGAAAA 
M TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
U 5 T CT TAGTGGCAATGAGAAAAGAAC TTGC C AGCAGAAT GGAGAGT GGTCAGGGAAAC AGC C C ATC T GC ATAAAAGC CTGCCGA 

III GAACCAAAGAT T TCAGACC TGGTGAGAAGGAGAGT T C T TC CGATGCAGGTT C AGTCAAGGGAGACAC CATTACAC CAGC TAT 

Iff AC TCAGCGGCC T TCAGCAAGCAGAAACTGCAGAGT GC CCC T AC C AAGAAGC C AGCC C TTCCCTTT GGAGATC T GC CCATGGG 

111 ATACC AAC ATC T GCATACC CAGC TCCAGT AT GAGT GC ATC TC ACCCT TC TAC CGCC GCC TGGGC AGCAGCAGGAGGACATGT 

lH CTGAGGAC TGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGC GGGAAAATTGAGAACATCAC TGCTCCAAAGA 

s CCCAAGGGT TGC GCTGGCC GT GGCAGGC AGC CATC TACAGGAGGACC AGC GG GGTGCATGACGGCAGC CTACACAAGGGAGC 

p| GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

GTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGGGATGATGACCGGGATGAGAAGACCATCC 



AGAGC C TACAGAT TT C TGC T ATCATTC TGCATCCCAAC TAT GACC CC ATC C T GCTT GATGC TGAC ATC GCC AT CC TGAAGC T 



, ..... 

rti 



CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 

T CCCACATCAC TG TGGC TGGC TGGAATGTCC TGGCAGACG TGAGGAGCCCT GGC T T C AAGAACGACAC AC T GC GC TCTGGGG 

T GGTC AGT GT GGTGGAC TCGC TGC TGTGTGAGGAGC AGCAT GAGGACC ATGGC ATCC CAGTGAGTGTC ACT GATAACATGT T 

CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 

CGAGCAT C T CCT GAGCC ACGC TGGCAT C T GATGGGAC TGGT CAGC TGGAGC TAT GATAAAACATGCAGCCAC AGGC T C TCCA 

CTGCCTTCACCARGGTGCTGCCTTTTAAAGACTG^ATTGAAAGAAATATGAAATGRACCATGCTCATGCACTCCTTGAGAAG 

TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 

GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 

C TAC TAGGACAGC CAAT TGGAAGATGCC AGGGC TT GCAAGAAGTAAGT T TC T TCAAAGAAGACCATATACAAAAC CTC TCCA 

CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 

T GTGTACATGGC CACAGTACAGTC TGGTC CT TT TCC TTCCCCAT C TC T TGT AC ACAT T T TAATAAAATAAGGGTT GGCT T CT 

GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 
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MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLE S Y P LNAH CE W T I HAKP G FV I Q LR FVML S LE FD YMC Q YD YVE VRDGDNRDGQ 1 1 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRW PWQAAI YRRT S GVHDGS LHKGAW FLVC S GALVNERT VWAAHCVT DLGKVTM IKT ADL 
KWLGKFYRDDDRDEKT I QSLQ I S AI I LHPNYDP I LLDAD I AI LKLLDKARI S TRVQP I CLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 



Q Signal peptide: 



amino acids 1-23 
EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 



APPJD=1 0063532 



Page 190 of 320 



FIGURE 3Q 



GGT T C CTACAT C CT CTC ATCT GAGAAT CAGAGAGCATAAT CT T CTTACGGGCC CGT GATTTATTAAC GTGGCTTAAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTT GG GCCCTT GTAGCT GACAGAAGGT G GC C AGGGAGAAT GCAGCACACT GCT CGGAGA ATGA AGGCGCT T CTGTT G C 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCTCCCACTAC GGC C T GACCAAAGAT AGGAAGAG GCGCT CACAAGAT GGCT GT C CAGACGGCTGT G C GAGCC 
T CACAGCCAC GG CTCCCT CC CCAGAGGT TT CT GCAGCT GC CACCAT CT CCTTAAT GACAGAC GAGCCT GGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACG GCC CTT T GAGAGAT CCACTATT AGAAGCAGATCATTTAAAAAAATAAAT CGAGCTTTGAGT GTT CTTC GAA 
GGACAAAGAGC GGGAGT GCAGTT GC CAACCAT G CCGAC CAGG GCAG GGAAAATTCT GAAAACACCACT GC CC CTGAAG 
T C T TT C CAAG GTT GT AC C AC CT GATT C C AG AT GGT GAAAT T AC CAG CAT C AAGAT C AAT C G AGT AGAT C C C AG T GAAA 
G CCTCT CTATTAGGC T GGTGG GAGGTAG CGAAACCC CACT GGT CCAT AT CATTAT C CAACACAT TTAT CGT GATGGGG 
T GATC GCCAGAGACGGC CGGCT ACT GCCAGGAGACAT CATT CTAAAGGTCAAC GGGAT G GACATCAGCAAT GT CCCT C 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAACAAT G GACAGGCC CCG GAT GCCT ACAGACC CCGAGAT GACAGCTTT CAT GT GATTCTCAACAAAAGTAGC C 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 

GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 

S 

GGAGCC CT GACAT CTT T CAGGAAGC C GGCT GGAACAGCAAT GGCAG CT GGT CC C CAGGGC CAGG GGAGAGGAGCAACA 

Has? 

3 s? CTC CCAAGCC CCT CCAT C CT ACAATTACTT GT CAT GAGAAG GT GGTAAATATC CAAAAAGAC CC CGGT GAATCT CTC G 

iU I 

GCATGACCGTCGCAGGGGGAGCAT CACATAGAGAAT GGGATTTGCCTAT CTATGTCATCAGTGTT GAGCCCGGAGGAG 
Jy TCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGTCGAACTGACAGAGGTCAGCC 
S>*i GGAGT GAG GC AGT GGCAT T AT TGAAAAGAACAT CAT CCT C GAT AGT ACT CAAAGCT TT GGAAGT CAAAGAGTAT GAGC 

HS CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 

GGGT CATGTGGCT GGAAT T ACCACGGTGCT T GTATAACT GTAAAGATAT T GTATTACGAAGAAACACAG CT GGAAGT C 
T GGGCTTCT GCAT T GTAGGAGGT TAT GAAGAAT ACAAT GGAAACAAACCTTTTT T CAT CAAAT CCATT GTT GAAGGAA 
CACCAG CATACAAT GATGGAAGAATTAGAT GT GGT GATAT T CTT CTTGCTGTC AAT GGTAGAAGTACAT CAGGAATGA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TTTT ATAGA AT CAAT GAT GGGTCAGAGGAAAACAGAAAAAT CACAAATAGGCT AAGAAGTT GAAACACTATATT TATC 
TTGT CAGTTTT TATAT TTAAAGAAAGAATACATT GT AAAAATGT CAGGAAAAGTAT GATCATCTAATGAAAGCCAGTT 
ACACCT CAGAAAATAT GAT TCCAAAAAAATTAAAACTACT AGT TTTTTTT CAGT GT GGAG GATT T CTCATTACT CT AC 
AACATT GTTTATATT TTT T CTATT CAATAAAAAGCC CTAAAACAACTAAAAT GATT GAT TT GTATACC CCACT GAATT 
CAAGCT GAT TT AAAT TTAAAATT T GGTATAT GC T GAAGT CT GCCAAGGGTACAT TATGGCCATT TTTAATTTACAGCT 
AAAATATTTT T TAAAATGCAT TGCT GAGAAACGTT GCT TT CAT CAAACAAGAATAAATATT TTT C AGAAGT TAAA 
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FIGURE AO 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I S LMTDE PGLDNPAYVS SAEDGQPAI S P VDS GRSNRTRARP FE RS T I RS RS FKKINR 
ALS VLRRTKS GS AVANHADQGRENS ENTT APEVFPRLYHL I PDGE I TS I KINRVDP SES LS I RLV 
GGSETPLVHI I IQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRGGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ril 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCT GTGT CTCCT TAC CT CAGAGGC CAGAGCAAGCT CATT TT CAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
G GGGAGGCGAAGACGAT GACC TCAGAC T CAGGGTTGAGC T CCAAAGAAT GAAAATT T CC C GGCCCCTG 
^ CCTGAAGT GGGTAAATATACAAT GGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 

O GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
111 T AGTATC T GTGGAACACAATCCT TTATATAT CAACAT CACAGT GGAT T TCTGGTTT GGT GCATGACCC 

y TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
J*J ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
Sf 5 T AGCAGAGCTC CT GGT GATGTAGAGTATAAAACAGT TGTAACAAGACAGCTT T CTTAGT CATTT T GAT 

C ATGAGGG T TAAATAT T G TAATAT GGAT ACT T GAAGGAC T T TAT ATAAAAGG AT GAC T CAAAG G ATAA 
AATGAACGCTATTTGAGGAC T CT GGT T GAAGGAGAT T TAT TTAAATT TGAAGT AATATAT TATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTT CTCGT CCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTC CACAAAAGAGGGGAGAAAAGGCGAC GAAT CAGGACACAGTGAACT T GGGAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TTTAAAGAGTT T TTGTAAAATGAT TT T GTACAAGTAGGATAT GAATT AGCAGTTTACAAGTT T ACAT 
ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 



MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKPCVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHIMYLLEHLHPFLQRQQLDYGIWIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
S VEHNPLY INI TVDFW FGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GCT CAAGACCCAGCAGTGGGACAGC CAGACAGAC GGCAC GATGGCAC TGAGC T CCCAGATCT GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACG TAGA ACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AA2AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE AA 

MALS S Q I WAACLLLLLLLAS LT S GS VFPQQTGQLAELQPQDRAGARAS WMPMFQRRRRRDTH FP I 
C I FCCGCCHRS KCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 4* 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
G AC C T T C AAC ACAAC CCCTCTTGT C ACC ATACAGC C AGAAGG GGGC AC TAT CAT AG T GAC C C AAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAAT GACT CAGGGAT C TACTATG TGGGGATATACAGCT CAT CACTCCAGCAGCCC TCCACCCA ' 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
AT AAGAAT GGCACC TGTGTGACCAAT CT GACATGC TGCAT GGAACATGGGGAAGAGGATGTGAT T 
III TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 

III C T GGAGAT GGGGAGAAAGT GAT ATGACC T TCATCTGCG T T GCCAGGAACC C T G TCAGCAGAAAC T 

y s 

m TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 

II! GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 

s b 5 

Jj* G T T T CT GAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAGT GGACAT TTG TCGGG 

lw AAAC T C C T AAC AT AT G C C C C CAT T C T G G AGAG AAC AC AG AG T AC GAC AC AAT C CC T C AC AC T AAT 

P| AGAACAAT C CTAAAGGAAGAT CCAGCAAATAC GGT T TAC T CCACT G T GGAAAT ACC GAAAAAGAT 

111 GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
T CTAGACAGCAG T GCAC TCCCC TAAGT C T CT GC TCA 
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FIGURE 46 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIWTFNTTPLVTIQP 
EGGT 1 1 VTQNRNRERVD FPDGGYSLKLSKLKKNDS G I Y YVGI YS S S LQQPS TQE YVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAMESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAA.DDPDSSMVXLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDI CRE T PNI C PHS GENTE YDT I PHTNRT I LKEDPANTVYS TVE I PKKMENPHS LLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT TGT GAAT T T TC AT T GAAAAACATCAGTGACAT TCAT CCAGAATCCT TCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGT TTCCACT TCGATTC TGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T GTAG T T TAATGGGAATAAAAT GTAAGT AT CAG TAGT T T GAAAAAAAAAAA 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMS LTARKRACCNNRTGMFL S S FFSVI TVI GAL YCML I S I QALLKGPLMCNS PSNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVG I LEVLFGLSQ I VI G FLGCLCGVSKRRS Q IV 

Important features : 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAG T GAGGAACCT GCAGAGTCACACAGT T GC TGACCAAT TGAGCT G TGAGCC T GGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
C ATG GAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT T TGCCAATAAAGACGAT CCC T TC TAC TATGACT GGAAAAAC CTGCAGC TGAGCGGACT GAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 
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MERVTLALLLLAGLTALEi^^ 

S QKQHS PVPEKAI PL I TPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-1'6 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation- sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGAC CC TGGTGGGAGGAAGACAC T CTGGAGAGAGAGGGGGC TGGGCAGAGATGAAGT T CCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGAGCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
H AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
S? T TGGCAGACAGGCAGAAGAT GT CAT T CGACAC GGAGCAGATGC TG T CCGCGGC TCC TGGCAGGGG 

y 

fft GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

III CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

111 GAAAC TCAGCAGGCAGCT T TGGAAT GAAT CCTCAGGGAGCT CC C T GGGGT CAAGGAGGCAAT GGA 

Jf 5 ? GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

Q ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

HI AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

W T GGC G G CAG CAG T GG T GG C AGC AG T GG C AACAG T G G T G G C AG CAGAGG T GACAG C GGGAG T GAG T 

I!? CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

Hi CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

III 

CTTCAGAGGACAGGGAGT TT CCAGCAACAT GAGGGAAATAAGCAAAGAGGGCAATCGCCTCCT TG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATT T TAAATCCAAGCTGGGTTTCAT CAACTGGGATGCCATAAACAAGGACCAGAGAAGCT CTC 
GCATCCCGTGACC TCCAGACAAGGAGC CACCAGAT TGGAT GGGAGCCCCCACAC T CCC T CC T T AA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *2 



MKFQGPIACLLIJUXLGSGEAGPLQSGEESTGTNI^ 

E7VLGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGWTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 



Signal peptide: 
amino acids 1-21 



N-glycosylation site. 

amino acids 265-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATCTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
T TGCACCCAAGGATAATCT CT TCATCAGGT TCCTGAAGCCCT GGCTGGGAGAAGGGATACT GCTG 
!F : AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
j!f GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
III CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
||| CAGAAAT GCATC TTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGT GAATATATTGCCACCAT 

Ifl C TT GGAGCT CAGTGCCCT T GTAGAGAAAAGAAGC CAGCAT ATCC TCCAGCACAT GGAC TT T C TG T 

^ ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

111 

GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

f«* AGC CAAGT CCAAGAC T T T GGAT T TCAT T GATGTGCT T C TG CT GAGCAAGGAT GAAGAT GGGAAGG 

|f| CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 

O GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

j~ ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 

2? AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 

111 

TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 

MSLLSLPWLGLRPVi^SPWLLLLLWGSWLLARIIAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHP PAP FI SRCCTQDI VLPDGRVI PKG I TCL I DI I GVHHNPTVWPDPEVYDP FRFDPENSKGRS P 
LAFIPFSAGPRNCIGQAFAMAEMKWIAmLLHFRFLPDHTEPRRKLELIMRAEGGLW 

VGLQ 

Important features : 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 4 61-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 



ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT T CATGGC CAGT T T T ATGAAGC T T T GGAAGGCACTAT GGACAGAAGC T GGTGGACAG TT T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACAT T T GAGGGT T ACT T T TGGAAGCAACAAT ACAT TC T CGAACC TGAAT GT CAGT A 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T C T GGAT GT T GT CCCACTGAATT CCCAT GAATACAAACCT AT TCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 56 

MGPVKQLKBMFEPTRLIATIMVLLC^ 
DAVKKC FAVCLA 



Important features: 
Signal peptide: 

amino acids 1-33 

i= 

CI Type II fibronectin collagen-binding domain protein. 

T3 amino acids 30-72 

III 

III 
W 

ru 

5.' 

o 
in 

0 

m 
□ 
ru 
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FIGURE *7 

CGGCT CGAG C TCGAGCC GAATC GGCT C GAGGGGCAGT GGAGCACCCAGCAGGC CGCCAACMGCT CT GT CTGT G C CT G 

TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 

ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 

GGAGAT AAGGAC CTT GAT GGGCAGCTAGACTTT GAAGAATTT GT CCATTAT CT CCAAGAT CAT GAGAAGAAGCT GAGG 

CT G GT GTTTAAGATTTT GGACAAAAAGAAT GAT GGAC GCATT GAC GCG CAGGAGAT CAT GCAGT CCCT GCGGGACTT G 

GGAGT CAAGATATCT GAACAGCAGGCAGAAAAAAT T CT CAAGAGCAT G GAT AAAAACGGCACGAT GAC CATC GACTG G 

AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 

AT CT TTGAT GTGGGT GAGAATCTAACG GT CC C GGAT GAGTT CACAGT GGAG GAGAGGCAGAC GGGGAT GT GGTGGAGA 

CAC CT GGT GGCAG GAGGT GGGGCAGGGGCCGTAT C CAGAAC CT GCAC GGC C CC CC T GGAC AGGCT CAAGGTGCTCAT G 

CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 

TCACTCT G GC GGGGCAAT GGCAT CAACGTC CT CAAAATTGC C C CCGAAT C AGC CAT CAAAT T CAT G GCCT AT GAGCAG 

AT CAAGCGCCTT GTT GGTAGT GACCAGGAGACT CT GAG GATT C ACGAGAGGCTTGT G GCAGGGT C C T TGG CAGGGGCC 

AT C G CCCAGAGCAGCAT CTACC CAAT GGAGGT C CT GAAGACC C GGAT GGC GCT GC G GAAGACAGGCCAGTACT CAGGA 

ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 

GG CATCAT CC CCTAT GC CGGCATCGACCTT G CAGT CTACGAGACGCT CAAGAAT GCCT GGCT G CAGC ACT AT GCAGT G 

AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 

C CC CT GGC CCTAGTCAGGAC CC GGAT GCAGGCGCAAGCCT CT ATT GAGGG C GCT CC GGAGGT GACCAT GAGCAGCCT C 

TTCAAACATAT C CTGC GGAC CGAGGGGGCCTT CGGGCT GTACAGGGG GCT GGCCC CCAACTT CAT GAAGGT CAT CC CA 

GCT GT GAG CAT CAGCTACGT GGT CTACGAGAACCT GAAGATCACCCT GGGCGT GCAGT CGC GGTGACGGG GGGAGGGC 

C GC CC GGCAGT GGACT C GCT GAT CCT GGGCC GCAGCCT GGGGT GT GCAGC CAT CT CAT TCT GT GAAT GT GCCAACACT 

AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 

GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 

CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 

TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 

CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 

ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 

TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 

CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 

GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 

CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 

ACT GTT GGGAAAAG GGTT TT GT CCAGAAGGACAAGCCGGACAAAT GAGC GACTTCT GT GCT TCCAGAG GAAGAC GAGG 

GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 

G GGAC CAGCC CCACATTCCACTT GT GT CACT GCTT GGAACCT AT TT ATT TT GT ATT TATTT GAACAGAGTTAT GT CCT 

AACT AT TTTTAT AGATTT GTT T AATT AATAG CT T GT CATTTT CAAGTT CATTTTTTAT T CATAT TT AT GTT CAT GGTT 

GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 

GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 

GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 

AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 

TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 

CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 

T GAGGAAAT CT TCAATAGGAT GCAAAGAT CAAT GCAAAAATTGT T ATATAT GAACATAT AACT GGAGT C GT CAAAAAG 

CAAATTAAGAAAGAATTGGACGTTAGAAGTT GT CATTT AAAGCAG CCTT CT AATAAAGTT GTT T CAAAGCT GAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 58 

MLCLCL YVPVI GEAQTE FQYFE SKGLPAELKS I FKLS VFI PS QE FS T YRQWKQK I VQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 L YWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSIAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKGYVPNMLG 1 1 PYAG I DLAWETLKNAWLQHYAVNSADPGVFVLLACGTMS S TC 
GQLA.S Y PLALVRTRMQAQAS I EGAPEVTMS SL FKHI LRTEGAFGLYRGLAPN FMKV I PAVS I S YV 
VYENLKITLGVQSR 

Important features : 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature, 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCT GT CGGAGCAGGAT GAAAT GT T CAGAGGCCGGACAGCAGT GT TTGCT GAT CAAGT GAT AGT T 
GGC.AATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCAC TT C T AAAGGCAAGGGGAATGCT AACCT T GAGTAT AAAACT GGAGCCT TCAGCAT GC CGG 



||| AAGTGAATGTGGAGTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 

|y CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 

III CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

¥J TCAACAACACATACTCCTGTATGATTGAAAATGACATTGCCAAAGCAACAGGGGATATCAAAGTG 

ill 

ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 

s 

pi CTC'TTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 

|f| GTGCCT TGGCCACAAAAAAGCATGCAAAGT CAT TGT TACAACAGGGAT CT ACAGAACTATT TCAC 

13 CACCAGATAT GAC C TAGT T T TATAT T TCTGGGAGGAAAT GAAT T CAT AT CT AGAAGTCT GGAG T G 

m 

AGCAAACAAGAGCAAGAAACAAAAAGAAGC CAAAAGCAGAAGGC TCCAATAT GAACAAGAT AAAT 

r? 



CTATCT TCAAAGACATAT TAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGT GT GT TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGT 
CGGGGAC ACCGAT T T T AT AAATAAAC T GAGC ACC T TCT T T T T AAAC AAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA?^AA7\AAAAAAA 
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FIGURE 60 

MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DI VI QWLKE GVLGLVHE FKE GKDE L S E QDEMFRGRTAVFADQVI VGNAS LRLKNVQL TDAG T YKC 
Y 1 1 T S KGKGNANLE YKTGAFSMPE VNVD YNAS SET LRCE APR WFPQP TWWAS QVDQGANFSEVS 
NTSFELNSEWTMKWSVLYOTTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTGTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 

t* k GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 

P! 

ATTGGT CTTCATTCAATGAT GAGACCTGCCTCATGATGAT AAACATGT TT GACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAT^ATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 



e 



f*S CCATGACAGCTTCTCGGATGCT ATGA CCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
|f| TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 

O ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 

Pi I 

AATAGT GAGGAC CGGGGCT GAGGCCACACAGATAGGGGCC TGAT GGAGGAGAGGATAGAAGT T GA 

ms ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

III 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 



M&SYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLMMINMFDKTKSGRIDWGFSALWK 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAV QGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 

TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 

AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 

TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 

GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATGGTGGGGGCTGCCTGCCATG 

GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 

AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 

CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 

TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

T GC TCATC T CCAATAAAAT AAAAGCACT TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAA&AAAAAAAAAPAAAAAA 



FIGURE 6d 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 

DPRSNDIJ^VPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPW^ 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 

PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 

WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 



ru 
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FIGURE 6s 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 



APP ID=1 0063532 



Page 217 



FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 



N-glycosylation site. 



CP 



amino acids 37-41 




N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
lZ GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
|S| GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
ffl GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
III GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
H! AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTTTTATATATTTATATAAAATTAG 

£f TAGT GAGAT GTAAAAAAAAAAAAAAAAAA 

III 



Pi 

ru 

fll ' 
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FIGURE 68 



MANPGLGLLLALGLP FLLARWGRA.WGQI QTT S ANENST VLPS S T S S S S DGNLRPEAI T AI I WFS 
LLAALLLAVGLALLVRKLRE^ 

Important features: 
Signal peptide: 

amino acids 1-19 



P Transmembrane domain: 

fl! amino acids 56-80 

in 

N-glycosylation site. 

5 :: 5 

?»s amino acids 36-40 

i II 

aS 

Q cAMP- and cGMP-dependent protein kinase phosphorylation site. 



amino acids 86-90 



Tyrosine kinase phosphorylation site. 



amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GC CAGGAATAACTAGAGAGGAACAATGGGGTTATT C AGAG GTTT T GTT TT CCT CTTAGTT CT GT GCCT GCT GCACCAG 
TCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATATTGTCATTGTTATAGATCCTAGTGTGCCAGAA 
GAT GAAAAAATAAT T GAACAAATAGAGGAT AT GGT GACT AC AGCT TCTAC GTAC CTGTTT GAAGCCACAGAAAAAAGA 
TTTTTTTTCAAAAATGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATGAA 
AACCATAAACATGCTGATGTTATAGTTGCAC CAC C TACACTCC CAGGTAGAGAT GAAC CATACACCAAGCAGTTCACA 
GAAT GT GGAGAGAAAGGCGAAT ACATT CACTT CACCCCT GACCT T CTACTTGGAAAAAAACAAAAT GAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACCGTGCTAAGTCAAAAAAAATCGAAGCAACAAGGT GTTCCGCAGGTAT CT CTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCT GT CTTAGTAGAGCAT GCAGAAT T GATT CT ACAACAAAACT GT AT GGAAAAGATT GT CAAT TCTTT 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAAT GT TTATGCAAAGTATT GAT T CT GT T GTT GAATT TT GT AACGAA 
J..^ AAAAC CCATAAT CAAGAAGCTC CAAGCCT ACAAAACAT AAAGT GCAATTTTAGAAGTACATG GGAGGT GATTAGCAAT 

lZ T CT GAGGAT TTTAAAAACAC CATACCCAT GGT GACACCACCT CCT C CACCTGTCTTCTCATTGCTGAAGATCAGTCAA 

CI AGAATT GTGTGCTTAGTTCTT GATAAGTCT GGAAGCATGGGGGGT AAGGACC GCCTAAAT CGAAT GAAT CAAGCAGCA 

p| AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
AATAAGCT AATC CAAATAAAAAGCAGTGATGAAAGAAACACACTCAT GGCAGGAT TACC TACATAT C CT CTGGGAGGA 
|f! ACTT CCATCT GCT CTGGAAT T AAATAT GCATT TCAGGT GATT GGAGAGCT ACAT T CCCAACT CGAT GGAT CCGAAGT A 

| s | CTGCTGCT GACT GAT GG GGAGGATAACACT GC AAGTT CTT GTATT GATGAAGTGAAACAAAGT GGGGCCATTGTTCAT 

TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 
y 1 T CAGAT GAAGCT CAGAACAAT GGCCT CAT T GATGCTT TT GGGGCT CTT ACAT C AGGAAATACT GAT CT CT CC CAGAAG 

|s | T CCCT T CAGCT C GAAAGT AAGG GATTAAC ACTGAATAGTAATGCCTGGATGAACGACACT GTCATAAT T GATAGTACA 

GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
i'W AT AATGGAAAATTT CACAGT GGAT GCAACT T C CAAAAT GGC CTAT CT CAGTATT C CAGGAACTGCAAAGGTGGGCACT 

T GGGCATACAATCTTCAAGCCAAAGCGAACCCAGAAACAT TAACTAT TACAGTAACTTCT CGAG C AGCAAATT CTTCT 
GTGCCTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATTGTTTACGCAGAA 
ATT CTACAAGGATATGT ACCT GTT CT T GGAGC CAAT GTGACT GCT TT CATT GAATCACAGAATGGACATACAGAAGTT 
TTGGAACTTTTGGATAATGGTGCAGGCGCTGATTCTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATAT 
ACAGAAAAT GGCAGATATAGCTTAAAAGT T CG GGCT CAT GGAGGAG CAAACACT GCCAG GCT AAAATT AC GGC CT CCA 
CT GAAT AGAGCC GCGTACATACCAGGCTGGGTAGT GAAC GGG GAAAT TGAAGCAAAC CC GCCAAGAC CT GAAATT GAT 
GAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 
CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
ACATGGACAGCACCAGGAGATAATTT T GAT GTTGGAAAAGT T CAAC GTT AT AT CATAAGAAT AAGT GCAAGT ATT CTT 
fl | GATCTAAGAGACAGTT TT GAT GAT GCT CT T CAAGT AAATACT ACT GATCT GT CAC CAAAGGAGGC CAACT CCAAGGAA 

AGCT TT GCATTT AAAC C AGAAAATAT CTCAGAAGAAAATGCAACCCACATAT TTAT TGCCATTAAAAGTATAGATAAA 
AGCAATTT GACATCAAAAGTAT CCAACATT GCACAAGTAACT TT GT T TAT C CCT CAAGCAAATCCTGAT GAC ATT GAT 
C CTACACCTACT C CT ACT CCTACT CCTACTCCT GAT AAAAGT CAT AATT CT GGAGTTAATATTT CTACGCTGGTATTG 
TCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATTTGAACCTTAACGAAGAAAAAAATCTTC 
AAGTAGAC CTAGAAGAGAGTT TTAAAAAACAAAACAATGT AAGT AAAGGAT ATTT CT GAAT CTT AAAATT CAT CCCAT 
GT GT GAT CATAAACTCATAAAAATAAT TTT AAGAT GT CGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATA 
TGTAAAAACTGTCAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHEMHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGS CLSRACRI DS T TKL YGKDCQ FFPDKVQTEKAS IMFMQS I D S WE FCNEKTHNQE AP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFTVSDEAQl^GLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
^; AANSSVPPI TVNAKMNKDVNS FPSPMIVYAE ILQGYVPVLGANVTAFIESQNGHTEVLELLDMGA 

GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARL^^ 

RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRISAS ILDLRDS FDDALQVNT TDLS PKEANSKES FAFKPENI SEENATHI FIAI 
KS IDKSNLT SKVSNIAQVTLFI PQANPDDI DPT PT PT PT PT PDKSHNSGVNI S TLVLSVI GSWI 
VNFILSTTI 



111 

3 -A 
SIX s 

m 



Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGGGCAGGGGTGA 
CAACAGGT GT CATCTT TTT GAT CTC GT GT GT GGCT GCCTT CCTATTT CAAGGAAAGACG C CAAGGTAATTTT GAC CCA 
GAGGAGCAAT GATGTAGCCACCT C CTAAC CTT CCCTT CTT GAACCCC CAGTTAT GCCAGGATT T ACTAGAGAGT GTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GACT T GC CT GAGGGT GGACCAGAAGAAAGGAAAGGT CC CCT CT T G C T GT T GGCTGCACAT CAG GAA 
GG CT GTGAT GGGAAT GAAGGTGAAAACTT GGAGATTT CACTT C AGTCATT GCTT CT G CCT GC AAGATCAT CCTTTAAA 
AGT AGAGAAGCT GCT CT GT GTGGT GGTTAACT CCAAGAGGCAGAACT C GTT CTAGAAGGAAAT GGAT GC AAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
C G CAACTAC GT GAGCAGCCT GAAG CGGCAGAT CGCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAG C T CAGG 
AAT G GGCAGTAC CAAGC CAG CGAT GCT GCT GGCCT GGGTCT GGACAG GAGCC C CC CAGAGAAAACCCAGGCC GACCT C 
CTGGCCTTCCT GCACT C GCAGGTGGACAAGGCAGAGGT GAAT GCT GGC GT CAAGC T GGCC ACAGAGTAT GCAGCAGT G 
CCTTT CGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCAC CCCGAGGAGAAGCCTGT G 
AG GAAGGACAAGC GGGATGAGTT GGT GGAAGC CATT GAAT CAGCCTT G GAGAC CCT GAACAAT CCT GCAGAGAACAG C 
C C CAATCACCGT C CT TACAC GGCCT CT GAT TT CATAGAAGGGATCT AC CGAACAGAAAGGGACAAAGGGACAT T GTAT 
GAGCT CAC CT T CAAAGGGGACCACAAACACGAATT CAAACGGCTCAT CTT ATT T C GACCAT T CAGCC CCAT CAT GAAA 
GTGAAAAATGAAAAGCT CAACATGGCCAAGACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTT C 
C GGCAGTT CAT GCAGAATTT CAGGGAGATGT GCAT T GAGCAGGAT GGGAGAGT CCATC T CACT GTT GT TTACTTT GGG 
AAAGAAGAAATAAAT GAAGT CAAAGGAATACTT GAAAACACT T CCAAAGCT GC CAAC T TCAGGAACTT TACCT TCAT C 
CAGCT GAATGGAGAATTTTCT C GGG GAAAGGGACTT GATGT T G GAGCC CGCT T CT G GAAGGGAAGCAACGT C CTT CT C 
TT TTT CT GTGAT GTGGACAT CT ACTT CACAT CT GAAT T C CT CAAT ACGT GT AGGCT GAAT ACACAGC CAGGGAAGAAG 
GTATTTTATCCAGTT CTTTTCAGTCAGTACAATCCT GGCATAATATACGGCCACCATGATGCAGTCCCT CCCTT GGAA 
CAGCAGCTGGT CATAAAGAAGGAAACTGGATTTTGGAGAGACTTTGGATTTGGGAT GACGT GT CAGTAT CGGTCAGAC 
TTCAT CAATATAGGTGGGTTTGATCT GGACATCAAAGGCT GGGGCGGAGAGGATGT GCACCTTTATCGCAAGTAT CTC 
CACAG CAACCT CATAGT G GTAC GGAC GCCT GTGCGAGGACT CTT C CAC C T CT G GCATGAGAAGC GCT G CAT GGACGAG 
CTGAC CCCC GAGCAGTACAAGAT GT GCAT GCAGT CCAAGGCCAT GAAC GAGGCAT C CCAC GGC CAGCT GGGCAT GCTG 
GT GTT CAG GCACGAGATAGAGGCT CACCT T C GCAAACAGAAAC^GAAGAC AAGT AGCAAAAAAACATGAACTC C CAGA 
GAAGGATTGT GGGAGACACTT TTT CT TT C CTTTT GCAATT ACT GAAAGT GGCT GCAACAGAGAAAAGACTT CCAT AAA 
GGACGACAAAAGAATTGGACTGAT GGGTCAGAGATGAGAAAGCCTCCGATTTCT CTCTGTT GGGCTTTTTACAACAGA 
AAT CAAAAT CT CC GCT TT GC CTGCAAAAGTAAC CCAGT T GCAC CCT GT GAAGT GT CTGACAAAGGCAGAAT GCTT GT G 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AAT ATT CAT GATTTAAGAGC AGTTTT GT AAAAAATT CATTAGCAT GAAAGGCAAG CAT ATT TCT CCT CAT AT GAATGA 
GCCTAT CAGCAGGGCT CT AGT TT CTAGGAAT GCTAAAATAT CAGAAG GCAG GAGAGGAGAT AGGCTTAT TAT GAT ACT 
AGTGAGTACATTAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAATAT CGTGTCATATTTTCCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TT T TTT CCCTT GT GAGTT ATAGT CTG CT T AT TT AATTACCACTTTGCAAGC CTTACAAGAGAGCACAAGTT GGC CTAC 
ATTT TTAT ATT TTTT AAGAAGATACT TT GAGAT GCAT TAT GAGAACT TT CAGTT CAAAGCAT CAAATT GAT GC CATAT 
C CAAGGACAT GC CAAAT GCTGATT CT GT CAGGCACT GAATGT CAGGCAT T GAGACAT AGGGAAGGAAT GGT TT GTACT 
AAT ACAGACGTACAGATACTT T CT CT GAAGAGT ATT TT C GAAGAGGAGCAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAGAAACCACATTTT 
CT C CT CAGAAGT AGGGACC GCTTT CTTAC CT GT TTAAAT AAAC CAAAGTATACC GT GT GAACCAAACAATCT CTTT T C 
AAAACAGGGTGCT CCTC CT GGCT T CT GGCTT CCATAAGAAGAAAT GGAGAAAAAT ATAT ATAT ATATATATATATT GT 
GAAAGAT CAATCCAT CT GCCAGAAT CTAGTGGGAT GGAAGT TTTTGCT ACAT GT TAT CCACC CCAGG C CAGGT GGAAG 
TAACTGAATTATTT TTTAAATTAAGCAGTT CTACTCAAT CACCAAGATGCTT CT GAAAATT GCATTTTATTACCATTT 
C AAACTATT TT TTAAAAATAAAT ACAGTTAACATAGAGT GGTTT CTT CATT CAT GT GAAAATTATT AGC CAGCACCAG 
AT G CAT GAGCT AATTAT CT CT TT GAGT CCTT GCTTCT GTTT G CT CACAGTAAACTCATT GT TT AAAAGCTT CAAGAAC 
ATTCAAGCT GTT GGTGT GT TAAAAAAT GCATT GTATT GATTT GTACT GGT AGTT TAT GAAAT TT AATT AAAACACAGG 
C CAT GAATGGAAGGTGGT ATT GCACAGCTAAT AAAAT AT GAT TT GT GGAT AT GAA 



FIGURE 72 



MMMTORGLLA^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLIAFLHSQVDK 
AEWAGWIATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 

RKQKQKTS SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 



APP_ID=1 0063532 



Page 



FIGURE 73 



o 

■l-!sr 



GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCAT GATACCCT AC TGAACACCGAAT CCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
4 CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTAT GAGGGCCCACAT GG TCAGGACCAT TGGCCAGC CT CT TACCC TGAG T G T 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
||| GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATT TGCAGAGCTCCACATTGTACAT TATGACTCTGATTCCTATGACAGCT TGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGC T TATGAACACAT TC TGAG T CACT TGCATGAAGT CAGGCATAAAGATCAGAAGACC T C 

g 

m AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
Iff GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
O AT TTCAAT GGAACAGCTGGAAAAGCT T CAGGGGACATTGT TCTCCACAGAAGAGGAGCCCTCTAA 

tjf GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
St TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 

TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGT GTGG T CT T C AC C T CAGCACAAGCCACGAC T G AGGC ATAAAT TCC T TC TCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAAT CGCTGTGTTGTTAATGCAGAGANCAAACT CT GT T TAGTTGCAGGGGAAGT TTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGT TGCTGTTGAAGTTGTATAT TT TT GATCAATATAT TTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 



MLFS ALLLEVI WI LAADGGQHWT YEGPHGQDHWPAS YPECGNNAQS P I D I QTDS VT FDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFI QAGSS YTTGEMLSLGVGI LVGCLCLLIAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 



o 

Q Important features of the protein: 

If I Signal peptide: 



ammo acids 1-15 

s H 

Ill 

*is Transmembrane domain: 

I hi 

= amino acids 291-310 

^; N-glycosylation site. 

amino acids 213-216 

5 

f|| Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7fi 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTT TGCCTAAACCTGCAAACATCACC TTCT TATCCATCAACATGAAGAATGT CC TACA 
ATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATTTCATCACAA 
AT T GGCCCAC CAGAGGTGGCACT GACT ACAGATGAGAAGTC CAT T TC TGT T GTCC TGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGAT GGGCTAT TCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAAT TT 
GAT T T TGAT T T ATGGAAATGAATT TGACAAAAGAT TC T T TG TGC C T GC TGAAAAAATC GT GAT TA 
ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
p* GGAAGAGGAGGTGAAACATT TAGGGTAT GCTTCGCATTT GATGGAAAT TTTTTGTGACTCT GAAG 

If! AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
M ACAGTCATTGAATATGAATATGAT GTCAGAACCAC TGACAT TTGTGCGGGGCCTGAAGAGCAGGA 



ni 



GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
C CAGACAG GC CAC CAGGAGAAAAT GAAACC T AT CTCAT GCAAT T CAT GGAGGAAT GGGGG T TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMOQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 

DQ S S E FKAK 1 1 FW YVL P I S I T VFL F S VMG Y S I YR Y I HVGKE KH PAN L I L I Y GNE FDKR F FV P AEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCAT C C CT GT GCT GGGCAGCCTGGT GAACACCGT CCT GAAGCACAT C 
ATCtGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGGCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 



s s 

ff% CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 

U] TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 

W! CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 



CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
T CCT GCCAT CAAGGGT GACACCAT T CAGG T C TACC TGGGGGCCAAGT TGT T GGACTCACAGGGAA 
Q AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
III TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 



;:i T GAT CAATGAAAAGGC TGCAGATAAGCT GGGATC TACCCAGATCGT GAAGATCC TAACTCAGGAC 

ill ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T TACACCAAAGGTGACCAACT TATACT CAAC TT GAATAACATCAGCT CT GATCGGATCCAGC TG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLIAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITAKILQLQVKPSi^DQELLVKIPLDWAGFNTPLVKTIVE 
F^TTEAQATIR^IDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVI EAS FNGMYADLLQLVKVP ISLSI DRLE FDLLYPAI KGDTI QLYLGAKLLDS QGKVT 
KW FNNS AAS LTMPT LDN I P FS L I VS QDWKAAVAAVL S PEE MVLLDS VL PE S AHRLKS S I GL I N 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNI ITEI IHSILLPNQNGKLRSGVPVSLVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGT CAGCCTGGCAGAGAGACT CT GAAATGAGGGAT TAGAGGT GT TCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 



ATTCCTGTTGCGTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
h| CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
III C T GGAATCAT CCTC TGC T T T TCC TGCT CATCCCAGAGAAAT CGCT CCAAC TAG T ACGATGCC TAC 



CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 

GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 

2. 

fH G GTC T G TGAAAAACAGT GGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCAC TGGATCGT 

Iff GT CAGAAGG T GC TGCT GAGGAT AGACT GACT T TGGCCAT T GGAT T GAGCAAAGGCAGAAATGGGG 

13 GCTAGTGTAACAGCATGCAGGTTGAATTGCCAA.GGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
Iy TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 



GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCGTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT T TG TGCATGAAC TGAAATAAAACCATCC T ACGGT AT CC AGGGAACAGAAAGCAGGAT G 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YSTLLGLP AD I QAAQAMMVTSS AT SS LAC 1 1 S WGMRC TVFCQESRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSS PRPGQPPKVKSEFNS YSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 4 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site, 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 



APP ID=1 0063532 



Page 232 



FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
u CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
Q ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
O GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
j!f AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
|J j ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ft J ACT GCCAGAC TC CT GC TGCCAAGCC T GCAAAGATGAGGCAAGTGAGCAAT CGGATGAAGAGGACA 

;s GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
0 AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

CAGAC CCAAGGGAGCAGGCAGCACAAC TGTCAAGAT CGT CC T GAAGGAGAAACATAAGAAAGCCT 
n\ GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
Q CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
f|| CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGT ACC T GGCCCAAGGCCACACAGCCAGAATC T T CCACTT GACT CAGATCAAGAAAGT CAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGAC C TAACAGTTGCAGAT AT GAGCT GT ATAAT T GT T GT TAT T ATAT AT T AAT AAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEWVLSSLLGIALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVS C YRLHCPPVHCPQPVTE PQQCCPKCVE PHTPS GLRAPPKS CQHNGTMYQHGE I FSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S TRCPKAPGRVLVHT S VS P S PDNLRRFALEHEAS DLVE I YLWKLVKDEETEAQRGE VPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 




Signal peptide: 



amino acids 1-25 
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FIGURE 8a 



GACAGC TGT GTC TCGATGGAGT AGACTCT C AGAACAGCGC AGT T TG CCCT CCGCT C ACGC AG AGCCT C TC C 

GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 

TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 

TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 

GAC GCAGCATTC TC CT GT TT CCT GTCTCCT AAGACC AATGCAGAGGC CAT GGAAGT GC GGTT CT TCAGGGG 

CC AGT T CT CT AGCGTGGT CC ACCT CT AC AG GGACGGGAAGGAC C AGCC AT TT AT GC AG ATGCCAC AGT AT C 

AAGGC AGG ACAAAACTGGTGAAGGAT TCT ATT GCGGAGGGGC GC AT CT CT CT GAGGCTGGAAAACATT ACT 

GT GTT GGATGCT GGCCTCT ATGGGTG CAGGAT T AGT TCCC AGT CTT ACT ACC AGAAGGCC AT CT GGGAGC T 

ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 

TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 

CGCCGGGAGC AT AT CCTGTTCCAT GC GGCATGCTCATCTGAG CCGAGAGGTGG AAT CC AGGGT AC AGAT AG 

GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 

GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 

AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 

CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 

CT GG GAGGTGGACGGAGGAC AC AAT AAAAGGTGG CGCGT GGGAGTGTG CC GGGATG ATGT GGAC AGGAGG A 

AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 

ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 

TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 

AAGGCT TATT GAGGCC CT AC AT TG AGTATC CGT C CT AT AATG AGC AAAAT GGAACT CCCAT AGT CATCTGC 

CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 

T GAGTCCTCC TCACAGGCAACC ACGCCCTT CCT C CCC AGGGGTGAAAT G TAGGAT GAATCAC AT CCCACAT 

TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 

GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 

CT GAC AT T AC AT TT AGTT T GCT CT C ACT CC AT CT GGCT AAGT G ATCTT GAAAT ACC ACCT CTC AGGT G AAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 

ATCT T ATT GATGAC AG AGTGT ATCCT AAT GGT T T GT TC AT TAT AT T AC ACT T TC AGT AAAAAAA 



FIGURE 84 



MAMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 

L YRDGKDQP FMQMPQYQGRTKLVKDS I AEGRI SLRLENI TVLDAGLYGCRI S S QS YYQKAI WELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I V I CPVT QE S EKEASWQRAS AI PETSNSESSS QAT TP FL PRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8s 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGA GAAACT GCAGAGAC TCACCCT GATT GAGGGATCACAGCCCCTC CAGGCAAGGGAGAAGTCA 
GAGGCTGATTCTTGTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAATTATG 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
TCCCTTTTATTTTTT T AAC T AAAAGAC AG AC AAAT T CC T A 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS I RDARRS DAGRY FFRMEKG S I KWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCAC TC TGT T GAGC TC CAGGGCGCAGTGGAGGGAGGGAGT GAAGGAGC TC TC TGTAC 
CCAaGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
T C T TCGTC TC CATC TC TGCCCAGAAGC T GC AAGGAAAT CAAAGACGAAT GT CC T AGTGCAT T TGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 

h& GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 

n 

W C AACT AC AACACC T T T GGAT C T G C AG AGG C G GC CAC GAGC GAT GAC T ACAAGAACC C T GGC T AC T 

::: acgacatccaggccaaggacctgggcatctggcacgtgcccaataagtcccccatgcagcactgg 

CP 

1 - s agaaacagctccctgctgaggtaccgcacggacactggcttcctccagacactgggacataatct 
HI gtttggcatctaccagaaatatccagtgaaatatggagaaggaaagtgttggactgacaacggcc 
III cggtgatccctgtggtctatgattttggcgacgcccagaaaacagcatcttattactcaccctat 
^ ggccagcgggaattcactgcgggatttgttcagttcagggtatttaataacgagagagcagccaa 
JL cgccttgtgtgctggaatgagggtcaccggatgtaacactgagcatcactgcattggtggaggag 
gatactttccagaggccagtccccagcagtgtggagatttttctggttttgattggagtggatat 

If 3 

p ggaactcatgttggttacagcagcagccgtgagataactgaggcagctgtgcttctattctatcg 
fl| ttgagagttttgtgggagggaacccagacctctcctcccaaccatgagatcccaaggatggagaa 
caacttacccagtagctagaatgttaatggcagaagagaaaacaataaatcatattgactcaaga 

AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQT FCDMT SGGGGWTLVASVHENDMRGKCTVGDRW S SQQGSKADYPEGDGNWANYNT FGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYSPYGQRE FTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCI GGGGYFPEAS PQQCGDFSGFDWS GYGTHVGYS SSREI TEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 



f"l TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACGCTTCTGATTACCTTCA 
if! TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 



GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
TTAAGACTTATATACAGTT TTAGGGGACAATTAAAAAAAAAAAA 
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FIGURE QO 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VEl^GFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 

TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 

u 

Q 
O 

if! 

W 
If! 
W 

ru 

U 

m 
o 
ru 
p 
ru 
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FIGURE Ql 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGAraCAACTGAC 

TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGAT.G 

GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 

CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 

GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 

CACCCT CAGCCAAGGTGAAGAAAAT CT T T GGCTGGGGCGACT TCTACT CCAACATCAAGACGGT G 

GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 

CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 

AGT T CCACCAGGAACAGCAGAT C TT CAT CGAAGC C AAGGCC TCCAAAATCTT CAAC T GCCGGAT G 

GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 

CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 

CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGGCAGGCCAGGGC 

TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGGAGGGAAGGGGTTGGGCCTC 

AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

AGAAAGGGTCCCAAGTGCTGGT CCCAACCTGAAGCTGTGGAGT GACTAGATCACAGGAGCAC TGG 

AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 

TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 

CCT T GGT TCTTGGCATCC TGAGGAAAGATAGCAACAGGGAGGGGGAGATTT CATCAGTGTGGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 

GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 

CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 

CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 

GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 

CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 

GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 

CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 

TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 

TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE 02 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPKPRVPRKRGHISPKSRPMANSTL 
LGL^PGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 




N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 



2-oxo acid dehydrogenases acyl transferase 



amino acids 61-71 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGT TCCGATTTGCATAT TATAAACTCT TAAAAAAAGCCAGT GAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGGTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACC T CAGGGAACCAGCACT TC CCAAACCGCAGAC TACAT C T T T AGAGGAAGCACAAC TGTGCCT 
TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE Q4 



MTAAVFFGCAFIAFGPALALYVFT IAIEPLRI I FL IAGAFFWLVSLLI SSLWFMARVI IDNKDG 
PTQKYLL I FGAFVS VY IQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVS GLG FGIMS GV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 95 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAATC TGAAACATGAGGAGTACGAT TCTACTGT TTTGTCT TCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
id; CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
Q TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
:f TAAT CCAGATGTCCAGGATGGAAGCCTT CCAGCAGGAGGAGCAGGTGTAAATCC TGCCACCCAGG 

l a l GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
111 GCAGGCATCCAAAGGAGC ACACAT GCCAT C GAGGAAGC CACC ACAGAAT CAGCAAATGGAAT TC A 

III GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
til AT T GAT T ATAT TAT GGAAT AGAT T GAGACAC AT T GGAT AGT CT T AGAAGAAAT TAAT T CT TAAT T 

Ji, TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAA 

r § 

i ft CAAT AAT T CAATGGATAAATCT GTC T T TGAAAT ATAACAT TAT GCT GCCT GGAT GATATGCATAT 

f"l TAAAACATATT TGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

fll AAAAAAAAAAAAAAAAAAAAAAAfiiAAAAAAAAA 

CI 



APP_ID= 



10063532 



Page 247 of 320 



FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQ I FT S L 1 1 HSL FPGG I LP T S QAGANPDVQDGS L PAGGAGVNPAT QGT PAGRLP T P S G 
T DDD FAVT T PAG I QRS THAI EE AT T E SANG I Q 



Signal peptide: 

amino acids 1-16 



s : ; 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCT GT CT G T GGAAGT TCCAGAAAAC TAT GG T GGAAATT T CCCT T TATACC TGACCAAGT T GC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTAGCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGAT GAGCCAGGCACAGCCAACTCGGAT CT T CGAT T CCACATC CT GAGCCAGGC TCCAGCC CAGC 
!f CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
Q ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
O CCAGGCCTCAGGCCACCAGGCCACTGCCACCGT GGAAGT CTCCATCATAGAGAGCACCTGGGTGT 

in CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
Zl CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
::f TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
Wl AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
III ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
nl CAGT C CACCAGGTAC T GAAGT GACT AGACT GT CAGCAGAGGAT GCAGATGCC CCCGGCT CCCCCA 

ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 

! CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 

Q CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
If! TCGAAGTCGCAGTCACAGATATCAATGAT CACGCCCCTGAGTT CATCACT TCCCAGAT TGGGCC T 

p ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
pa CCTCGAGCCCGCCTTCCGCCTCATGGAT TT TGCCAT TGAGAGGGGAGACACAGAAGGGACT T T T G 

iJ* ■ GCC T GGAT T GGGAGC CAGACTCT GGGCATGTT AGACT CAGACT CT GCAAGAACC T CAG T TAT GAG 

tl' GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
rll AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CACCCAC T GGACCAT GT CAAGGAAGAAGGACC C GGAT CAACCAGCAGACAGCGT GCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
T CTGC CCT GGGGTGGAGGCACCATCACCAT CAC CAGGCATGT CT GCAGAGCC TGGACACCAAC T T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAZy^AAAAAW\AAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGMGQIVLSGDSGKAT 
EGPFi\MDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFL FLEAS DRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 

S T S LDHALERT YQLLVQVKDMGDQAS GHQATATVE VS I IES TWVSLE P I HLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPT SGSVTLGVLPLRAGQNI LLLVLAMDLAGAEGGFS STCEVE VAVT D INDHAPE FI TSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHE WVWQS VAKLVGPGPGPGATATVT VLVERVMP PPKLDQE S YEAS VP I S APAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHIIPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTC CAAGATTC TT CCCAGGAACACAAACGTAGGAGACC CAC GC T CCTGGAAGCACCAGCC TT T A 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCTTGCTTTCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
s . , TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
**** GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
W CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
p AGTAGGGCCAGCAC TGCCACCAACT CT GAGT CTAGCACACTCTCCAGT GGGGCCAGCACAGCCAC 

51 CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
Wl AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTGCAGCACGACCTCCAGTGGGGCCA 
||| GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAG TGGGGCCAGCAC TGCCACCAACTCTGAG 

flj TCCAGCACGACC TCCAGTGGGGCCAGCACAGCCACCAAC TCT GAGT CCAGCACGACC T CCAGT GG 

1* GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
^ CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
U AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
III CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
Q CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
f?J GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
*J* CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
Q ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
f|| ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGT CCTAACTGGT TCTGGAGGAGACCAGT ATCATCGATAGCCAT GGAGAT GAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCT TGAAGAAGGTAT TCCTCACCTTTCTTGCCTTTACCAGACACT GGAAAGAGAATACTATAT 
TGCTCAT TTAGCTAAGAAATAAATACATC TCATC TAACACACAC GACAAAGAGAAGCTGTGCT T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 

GAS TVTNSGS S VT S SGAS TATNSE S STVS SRAS TATNSE S S TLS S GAS TATNS DS S TT S S GAS T A 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 

ST TS SGAS TATNSDS S TVS S GAS TATNSES ST TS S GAS TATNSE S S TT S S GAS TATNSDS S TT S S 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 

AMEMSGRNSGP 



Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGT T T CTAGC TAC T GCACGT AC T T CATAAAGCAGGACT CT AAAAGC T T T GGAATC AT 
GGT GT CATGGAAAGGGAT T T ACTT TAT AC T G AC T C T GT T T TGGGG AAGCT T T T T TGGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAACTGGGGATGCAT TT GT TCCT GGAGAAAGAAGT GT CAT T ATCATGAACCAT CGGACAA 
GAATGGAC TGGAT GTTCC TGT GGAAT T GC CT GATGCGATATAGCTAC CT CAGAT TGGAGAAAAT T 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
C T T CAT T CATAGGAAAT GGAAGGAT GACAAGAGCCAT T T CGAAGACAT GATT GAT TAC TT T T GTG 
ATATT CACGAACCACT T CAACT CCT CAT AT T C CCAGAAGGGAC T GAT CT CACAGAAAACAGCAAG 
T C T C GAAG TAAT GCAT T T GCT GAAAAAAAT GGAC T TCAGAAAT AT G AAT AT G T T T TACATC CAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAAT CCAC T T TC ACGT CC AC CGG T AT CCAATAGACACCCT CCC C ACAT CCAAGGAGGA 
CC T T CAACT CTGGT GCCACAAAC GGT GGGAAGAGAAAGAAGAGAGGC TGCGTTCCTTC TAT CAAG 
GGGAGAAGAAT T T T TAT T T TACCGGACAGAG TGT CAT T CCACCT T GCAAGT CT GAAC TCAGGGT C 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATTTGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACTTTTACACAAACAGCCACAT 
T TAAAT T C AAAGAAAAAT GAGTAAG AT T AT AAGGT T TGCCAT GT GAAAACC TAGAGCAT AT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCAT TAT T TGT T AAAGAT AT T T T GCAC T TAAT T T TGT GGGAAAAATAT TGC TACAAT T T T T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT TAT TAAACAATCAT CAGGCT T T TAAA 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFT FWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS I LYWTLFSPAMCLLI YLY5 LVKWYFI I T I VI FVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
T CCAAAT CAT CCAT CCACCCC TGCT GT CATC TGTT TT CATAGT GT GAGAT CAACCCACAGGAAT A 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT CTACAGAGAT GGGGAAGACTGGGAAT CTAAGCAGATGCCACAGT AT CGAGGGAGAAC T GA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGAT T T GTCT TCAGAC TCCAGAGCAAATGCAGATGGG TACAGC CT G TAT GAT GTGGAGAT CT CC 
AT T AT AGTCCAGGAAAAT GC TGGGAGCATAT T GTGT TCC ATCCACCT TGCT GAGCAGAGTCAT GA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAAT CCAAAGGGAAAAT CCAGGCGGAAC T GGACTGGAGAAGAAAGCACGGACAGGCAGAAT T GAG 
AGAC GCCCGGAAACACGCAGTGGAGGTGACT C T GGATCCAGAGACGGC T CACCCGAAGC T C T GC G 
TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
T T T ACAAGGAAGAGTGT GGTGGC T T C TCAGGGT TTC CAAGCAGGGAGAC AT TAC T GGGAGGTGGA 
C GT G GG ACAAAAT G TAG GG T GG T AT G T G G GAG T G T G T CGG GAT GAC G T AGAC AGG GG GAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
T GACAT GT CAGT T TGAAGGC T T GT T GAGACCCTATATCCAGCAT GC GATGT AT GACGAGGAAAAG 
GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
CCCACACCACAGAC CCAGACACAGCCAAGGGAGAGT GCT CC CGACAGG TG GCCCCAGC T TCC T CT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
C T CCATCCAGC TAAGCGAT CT T GAACAAGT CACAACCTCCCAGGCT CCTCAT T T GC TAGTCACGG 
ACAGT GAT T CC TGCC TCACAGGT GAAGAT TAAAGAGACAACGAATGT GAAT CAT GCT TGCAGGT T 
T GAGGGCACAGTGT T T GC TAAT GATGT GT T T T TAT AT TATACAT TT T CCCACCATAAACTC TGT T 
T GCT TAT T CCACAT TAAT T TACT T T T C TC TATACCAAATCACCCAT GGAATAGT TATTGAACACC 
TGCTTTGTGAGGCTCAAAGAATAAAGAGGAGGTAGGATTTTTCACTGATTCTATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT T AAC AC AGACAC AAAAAT TC T AAAT AAAAT T T T AACAAAT T AAAC T AAACAATAT AT T T A 
AAGAT GATATATAAC TAC TCAGT GT GGT T T GTCCCACAAAT GCAGAGT T GGT TTAATAT T TAAAT 
AT CAACCAGTGTAAT TCAGCACAT TAAT AAAGTAAAAAAGAAAAC CAT AAAAAAAAAAAAAAA 



FIGURE 104 

MAFVL I LVLS FYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPET SAEAMEVRFFRNQFHAWH 
L YRDGE DWESKQMPQYRGRTE FVKDS I AGGRVSLRLKN I TPS D I GLYGCWFS SQ I YDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IYQENAGSILCSIHIJ^QSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 

trkswasqgfqagrhywevdvgqnvgwyvgvcrddvdrgknottlsp™gywvlrlttehlyft 
fnphfislppstpptrvgvfldyeggtisffntndqsliytlltcqfegllrpyiqhamydeekg 

tpificpvswg 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 



APPJD=10063532 



Page 256 



FIGURE log 



CI 
ill 



CCTT C AC AGGACTCT T CAT T GCTGGT TGGC A ATGA TGT AT CGGCC AGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CAC TGTTCAT TATGTGAGAT AT AATC AAAAGAAG ACCT AC AATT ACT AT AGC ACAT T GTCAT TT AC AAC 
T GACAAACT AT ATGCT GAGT T TGGCAGAGAGGCT T C TAAC AAT TTT ACAGAAATGAGCCAGAGACT T GAAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
AT CCTCACTCAGTTAAAATT AAAAAAAT CAAC AAGACAGAAACAGACAGC TAT CT AAACC ATT GCT GCGG A 
ACACGAAGAAGTAAAACTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAGAAGTAGAAGAGGGTGAATG 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
AT AAAACCT T CG AAAATG AAAC GGGGTCT C CGGAG AAT AATT GT CC AT GAAAAATACAAACACC CATCAC A 
T GACTATGAT ATTT CTCTTGC AGAGC TTTCTAGC CC TGTTCCCTACAC AAAT GC AGTACATAGAGT T T GTC 
TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
GATGGTT ACAGT CAAAAT C ATC TTCGAC AAGC AC AG GT GACT CT C AT AGACGCT AC AACT T GC AAT GAACC 
TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 
AGCT GGGGAG AT GAATGT GC G AAACCCAAC AAGC CTGGTGTTTATACT AGAGT T ACGG CC TTGCGGGACT G 



O 
||| 

iZ GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 



GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACTCT GT CATCTGT GAGCAAT AGTT GAAACT TT ATGT AC AT AG AG AAAT AG AT AATACAAT AT T AC 
ATT ACAGC CT GT AT T C ATTT GT TCT CT AGAAGT T TT GTC AGAATTTTG ACTT GTTGAC AT AAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT AT T AGGTGTT T TT CTT AGTGGAAT AT T AGAAATG ATCAT ATTC AT T AT GAAAGGTC AAGCAAAG AC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
TT AC TGAGGATGTC AACAT ATAACAAT AAAAT AT AAAT C ACC CA 
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FIGURE 106 

MMYRPDWRARKRVCWEPWVIGLVI FI SLIVLAVC I GLTVHYVRYNQKKT YNYYS TLS FT TDKLY 
AE FGREASNNFTEMSQRLE SMVKNAFYKS PLREE FVKS QVI K FSQQKHGVLAHMLL I CRFHS TED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRI I VHEKYKHPSHDYDI SLAELS S PVP YTNAVHRVCLPDAS YE FQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNEPQAYNDAI T PRMLCAGS LEGKTDACQGDS GGPLVS SDARDI WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 1Q7 

AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGT TCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGT CATT TT GACTTACT TTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 

GT GT AGAAC T GGC T TGTGGCGT T T GGACATATGAACAGGAACT TATGGT T CCAGTACAAT GGT CA 
i j GATATGGTCACTTTGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCTTACTCA 
!!! TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
y AAATGACAGAGAT GGACT GGCCCCCAGAT TCCTGC TGT GT T AGAGAATT CCCAGGATGT TCCAAA 

13 CAGGCC CAC CAGGAAGAT CT CAGT GACCT TTAT CAAGAGGGT T GT GGGAAGAAAAT GTAT TCC T T 

CTl T T T GAGAGGAACCAAACAACTGCAGGT GC TGAGGT T TCT GGGAAT C T CCAT T GGGGT GACACAAA 

I a | T C C T GG C CAT GAT TCT C ACCAT T AC TCTGCTCT GGGC TCT GTAT TAT GAT AG AAGGGAGCC T GGG 

ACAGACCAAAT GATGTCC T T GAAGAATGACAAC TCT CAGCACC TGT CAT GT CCCTCAGTAGAACT 
?! GT T GAAACCAAGCC T GT CAAGAAT C T TT GAACACACAT CCAT GGCAAACAGCT T TAATACACACT 

US T T GAGAT GGAGGAGT TATAAAAAGAAAT GTCACAGAAGAAAAC CACAAACT TGT TT TATT GGACT 

ffj T GTGAAT T T T TGAGTACATAC TATGTGT T T CAGAAATAT G TAGAAAT AAAAAT GT TGCCATAAAA 

TAACACC TAAGCATATAC TAT TCTAT GC T T TAAAAT GAGGAT GGAAAAG T T T CATGTCATAAGT C 
*n ACCAC C T GGACAATAAT T GAT GCCCT TAAAATGCT GAAGACAGATG TCAT ACCCACT GTGTAGCC 

hf TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
Vi T CCATGCAAACGAGTCACAT AT GGT GGGAC T GGAGCCAT AGTAAAGGT T GAT T TAC T T C TACCAA 

Q CT AGT ATATAAAG TACTAATTAAAT GCTAACATAGGAAGT TAGAAAAT ACTAATAAC T T TTAT T A 

f|I CT CAGCGAT CTAT TC T T C T GATGCT AAATAAAT TAT ATAT CAGAAAAC TT T CAATAT TGGT GACT 

IZ ACC TAAAT GT GAT T T T TGCTGGT T ACTAAAATAT TCT TAC CAC T TAAAAGAGCAAGC TAACACAT 

W TGT CT T AAGCT GATCAGGGATT T T T T GT ATATAAGTC TG TG TT AAAT CT G T AT AAT T CAGTC GAT 

III T T CAGT TCT GAT AATGT TAAGAATAAC CAT TAT GAAAAGGAAAAT T T GT C CTGTATAGCATCAT T 

ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TATTTAAATACT TAACCACTAATTTTGAAAATTACCAGTGT GATACATAGGAATCATTAT T C 
AGAAT GTAGTCTGGTC TT TAGGAAGTATTAATAAGAAAAT T TGCACATAACTTAGTTGATTCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCTTATCT TTGCCT TCTCCAAACAAGAAGCAATAGTCTCCAAGTCAATATAAATTC TACAGAAAA 
TAGTGTTCT TT TTCTCCAGAAAAATGCTT GTGAGAATCAT TAAAACATGTGACAAT TTAGAGATT 
C T T T GT T T T AT T T CAC T GAT T AAT AT AC TGT GGC AAAT TACACAGAT TAT TAAAT T T T T T T AC AA 
GAGTAT AGT ATAT T TAT T TGAAATGGGAAAAGTGCAT T TT AC T GTAT T TT GT GTAT T T T GT T TAT 
T TCT CAGAAT AT GGAAAGAAAATTAAAAT GT GT CAAT AAAT AT T T T C TAGAGAGTAA 
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FIGURE 108 



MAREDSWCLRCLLYALNLLFWIM^ 
VMIAVCCFLIIVGMLGYCGTVKRNLLL^ 

LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 



APP_ID=1 0063 532 



Page 260 



FIGURE 1QQ 

CC AAGGCCAGAGCT GTGGACACCTTAT CCCACTCATCCT C ATC CTCT TCCTCTGAT AAAGCCCCT ACCAGT GCT 
GAT AAAGT CTTTCTC GTGAGAGC CT AGAGGCCTT AAAAAAAAAAGT GCTT GAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CC T GGAG GAAC ACT GACCACCAG TT GGAT GAGT C T C AAGGT CC T CC T CT AT GT GAC AAC C AT GT GAAT GGGG AG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAAT GGGAACT GCT GTCTCT GGAACACCACGGT GGAAGTCAAGGCTTGCCCTGGAGGCT ACT AT GT GTATCGT 
CT GAC C AAGCC CAGC GT C T GCT T C C AC GT CT AC T GT GGT CAT T T T TAT GAC AT C T GC GAC GAGGACT GC CAT GG 
CAGCT GCTCAGAT ACC AGC GAGT GCACATGCGCTC C AGGAACT GT GCT AGGCCCTGACAGGCAGACAT GCTTT G 
AT GAAAAT GAAT GT G AGC AAAAC AAC GGT GGC T GC AGT GAGAT CT GT GT GAACC T C AAAAACT C CT AC C GCT GT 
GAGTGT GGGGTT GGCCGTGTGCT AAGAAGT GAT GGCAAGACTT GT GAAGAC GTT GAAGGAT GCCACAATAAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CT GAGGAT AACCACACTT GCCAAGTCC CT GT GTT GT GCAAATC AAAT GCCAT T GAAGT GAAC AT CCCC AGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCT CT TCTCTCTCAAGACAT GT GGT ACAGTGGTCGATGT GGT GAAT GACAAGATT GTGGC CAGC AACCTCGTGA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGC C GAAATC AT GGGAT CT TC CC AT TC ACTC T GGAGATC T T C AAGGAC AAT GAGT T T GAAGAGC CT T ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCT GCTTT GCCACCCCCACCTCCAAGATCGACGAGGTCCT GAAATACTACCTCAT 
CCGGGAT GGCT GT GTTTCAGAT GACTC GGT AAAGC AGT AC AC ATCCC GGGAT C ACCTAGC AAAGC ACT TCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGAC TAG TTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAATCAGAAGCT GGGT ATAAT AT TTC AAGTT ACAAACCCT AGAAAAAT TAAACAGT TACT GAAATTAT GA 
CTT AAAT ACCC AATGACTCCTTAAATAT GTAAATT AT AGTTAT ACCT TGAAATTTC AATT CAAAT GCAGACT AA 
TTAT AGGGAAT TT GGAAGT GTAT C AAT AAAACAGT AT AT AATTTT 



FIGURE no 



MPPFLLLTCL FI TGTSVS PVALDPCSAY ISLNE PWRNTDHQLDESQGPPLCDNH VNGEWYHFTGMAGDAMP 
T FC I PENHCGTH APWLNGSHPLEGDGI VQRQACAS FNGNCCLWNTTVEVKACPGG Y YVYRLTKPSVC FHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CE FPRLYT I SEGYVPNLRNS PLEIMSRNHGI FPFTLEI FKDNE FEEPY REALPTLKLRDSLY FG IE PWH V 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites . 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 



APPJM 0063532 



Page 262 



FIGURE ill 



GAGAGAGGCAGC AGCTTGCT CAGCG G ACAAGG AT GCTGGGCGTGAGGG AC C AAGGCCT GCCC TGCACT CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
T ACAGGAT CC TGAC AGTG AT CAACCT CTGAAC AGCCTCGAT GTC AAACCC CT GCGC AAACCC CGT AT CCCC 
AT GG AGACCTT C AGAAAGGT GGGGAT CCCC AT CAT CAT AG CACT ACT GAG CCTGGCGAGT AT CATC ATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
J! CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
S AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
jfl GCT ACAGC AGAGCT GTGGAGAT TGGC CCAGACC AGGAT CT GG AT GT TGT T GAAATC AC AGAAAACAGCCAG 

|y GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
Iff TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
III TC AGCATCCAGT ACGAC AAACAGC AC GT CT GT GGAGGG AGCATC CTGG AC CCCC ACTGGGTCCT CACGGC A 

ft! GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
* CTTCCC ATCCCTGGCTGT GGCCAAGATCAT CATCATTGAATT CAACCCCATGTACCCCAAAGACAATGAC A 

W TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
§!f GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
^1 GAT G T CTG AC AT ACT GCT GC AGGC GT CAGT CC AGGT C ATTGACAGC AC AC GGT GC AAT GCAG AC GATGCGT 

ipi ACC AGGGGGAAGT C AC CGAGAAGATGAT GT GT GCAGGC ATCCCGGAAGGGGGT GTGGACACCT GCC AGGG T 

n'l GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 

CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCC CTAGCT CGGC CAC ACT TGGTGCTC CC AGCATCCC AGGG AG AG AC AC AGCCCACT GAAC AAGGT CT 
CAGGGGT ATT GCT AAGCC AAGAAGGAAC TT TCCC AC ACT ACT GAAT GGAAGC AGGCT GTCTTGT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 

AAAA 



APPJD=10063532 



Page 263 of 320 



FIGURE 112 



MLQDPDS DQPLNS LDVKPLRKPRI PME T FRKVG I P 1 1 1 ALLS LAS 1 1 1 VWL I KVTLDKYY FLCG 
QPLHFI PRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRL S KDRS TLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 




DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 



Transmembrane domain: 



amino acids 32-53 (typell) 
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FIGURE 11a 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
fa GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
O CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
|3 AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
I GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ll\ ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
|y CAGAAGGGGT C TGC T TAGACCACCT G GT T TAT G T GACAGGACT T GCAT TCT CC T GGAACATGAGG 

RJ GAACGCCGGAGGAAAGCAAAGT GGCAGGGAAGGAACT TGTGCCAAAT TATGGGTCAGAAAAGATG 

5 GAGGT GT T GGGT TAT CACAAGGCAT CGAG TCT CC TGCATTCAGTGGACAT G T GGGGGAAGGGCT G 

^ CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
P CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
fff TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
O CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
^ GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 

GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATCT CCAAGGGCCC TT ATACCCCAGGAGACTT TGAT TTGAATT TGAAACCCCAAATCCA 
AACCT AAGAACCAGGT GCATTAAGAAT CAG T TAT TGC CGGGTGT GGT GGCCT GTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAA?^AAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACT CGGGAGGC TGAGACAGGAGAAT TAC T TGAACC TGGGAGGT GAAGGAGGCTGAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGAAAAAATAAAAAAAGAATTA 

TGGTTATTTGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 



Signal peptide: 

amino acids 1-15 
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FIGURE 115 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAATGGAGAGAAGAAGAAGATT TACATGGAAATTGATCCT GT GACCAGAACTGAAATA 
T T CAGAAGCGGAAATGGCACT GAT GAAACAT TGGAAGTGCACGACT T T AAAAACGGAT ACAC TGG 
CAT C TAC T T C GTGGGTC T TCAAAAAT GT T T TATCAAAAC T CAGAT T AAAGTGAT TCC TGAAT T T T 
C T G AAC C AG AAG AGG AAAT AG AT G AG AAT G AAG AAAT T AC C AC AAC T T T C T T T GAAC AG T C AG T G 
AT T T GGGTCC CAGCAGAAAAGCCTAT T GAAAACCGAGAT T TT C T TAAAAAT TCCAAAAT TC TGGA 
GAT T TGT GAT AACGTGACCATGTAT T GGAT CAATCCCAC T CTAAT AT CAG T TT C TGAGT TACAAG 
ACTT TGAGGAGGAGGGAGAAGATCT TCACTT TCCTGCCAACGAAAAAAAAGGGATTGAACAAAAT 
GAACAGT GGGT GGT CCC TCAAGT GAAAGT AGAGAAGACC CGTCACG C CAGACAAGCAAGT GAGGA 
AGAAC T T CCAATAAATGAC TAT AC T GAAAAT GGAATAGAAT T T GAT CC CAT GC TGGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TAT CACCAAAAAAAAAAAAAAAAAAA * 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HT FYSNGEKKK I YME I DP VTRTE I FRS GNGT DE TLEVHDFKNGYTGI Y FYGLQKC F IKTQ I KVI P 
EFSEPEEE I DENEE I TTTFFEQSVI WVPAEKP IENRDFLKNSKI LE I CDNVTMYWINPTL I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWVVPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCC I YCRRGNR YCRRVCE PLLG YYP YPYC YQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 0 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites . 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 

GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCT GC AT CG CG GCCACCGGG ATGG AC ATGT GGAGC ACCCAGGAC C TGT ACGAC AACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
T ATT TC AC CATCCT GGGACT TCC AGCC AT GCT GCAGGCAGTGCG AGCCCT GATGAT CGT AGGCATCGT CC T 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
| 4 TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
O TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
If! GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
IM TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
Ui CT T CCAAGCACGAC TATGTGTAATGC T CT AAG ACC T CT C AGC AC GGGCGG AAGAAACT C C CGGAGAGCTC A 

Z?[ CCC AAAAAACAAGGAG AT CC CATC T AGAT T TCT TCT TGCT T T T G ACTC AC AG C T GGAAGT TAGAAAAGCC T 

5W CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 

S 

m GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
i'S TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
f;;| TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
H| GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
CI ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
I'll AC ACTGTG AT CT TAAAAGT T ACCAAACC AAAG TCAT TT T CAGTT T GAGGC AACC AAACCT TT CT ACT GCT G 

T TGACATCTT CT TATT ACAGCAAC AC CAT T CT AG GAGT TTCCTGAGCT CT CC ACTGGAGT CCTCTTTCTGT 
CGCGGGTCAGAAAT TGTCCC T AGAT GAAT GAG AAAATT AT TTTT TTT AAT TT AAGT CCT AAAT ATAGT T AA 
AAT AAATAAT GT T T T AGT AAAAT GAT AC ACTATCTCTGT GAAAT AGCC TC ACCCCT ACAT GT GGAT AGAAG 
GAAATGAAAAAAT AAT TGCTTT GAC AT T GT CT AT AT GGT ACT TT GT AAAG TCAT GCT T AAGT AC AAAT TCC 
ATGAAAAGCT CACACCTGT AAT CCT AGC AC TTTGGG AGGC TG AGGAGGAAGG AT CACTT GAGCCC AGAAGT 
T CGAGACT AGCCTGGGC AAC AT GGAGAAGC C CT GTCT CT ACAAAAT ACAGAG AGAAAAAATC AGCCAGTC A 
T GGT GGC AT ACACCTGT AGTCC CAGC AT TC CGGG AGGC TGAGGT GG GAGGAT CACT T GAGCCC AGGGAGGT 
TGGG GCTGCAGT GAGCC ATGAT CACACC ACTGCACT CC AGCC AGGT GACATAGCGAGATCCT GTCT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLV^ 

CAI AGVSVFANMLVTNFWMS TANMYTGMGGMVQT VQTRYT FGAAL FVGWVAGGLTLIGGVMMCIA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 

AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 

GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 

GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 

CAT CAGGAT GCAGTGCAAAATCTAT GAT T CCCTGCTGGCTCTTTCTCC GGACCTACAGGCAGCCAGAG 

GACTGATGTGTGCTGCTT.CCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 

ACCAGGT GCACGGGGGACAATGAGAAGGTGAAGGCT CACAT T CTGCT GACGGCTGGAAT CATCT T CAT 

CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 

ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 

CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 

CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 

TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 

ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 

AAT CTTAATTACAGGAACT GTGCATCAGCTAT T TAT GAT TCTATAAGCTATT T CAGCAGAATGAGATA 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

ACGACATAGCATTAT GTACATAGATGAGT GTAACATT TATATCT CAC ATAGAGACATGCT TATAT GGT 

TTT ATTT AAAAT GAAAT GCCAGT C CAT TACACT GAAT AAAT AGAACTCAAC T ATTGCTT T TCAGGGAA 

ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 

T TT AT AATGAAGATT AAAAT GAAGGCT T TAAT CAGC ATT GTAAAGGAAATT GAAT GGCTT T CT GAT AT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 

CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 

TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 

AAAAT AT T TTT GTTT TT GTATTT GAAGAAGAAT GAT GCATT TTGACAAGAAAT CAT AT AT GTATGGAT 

ATATT TTAATAAGTATTTGAGTACAGACTTTGAGGTTTCAT CAATATAAATAAAAGAGCAGAAAAATA 

TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 

GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 

CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 

ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 

TAT ACAT TTATATTAAT AAAT TGTACATTTT TCTAAT T 



10063532 



Page 271 



FIGURE 120 



MATHALE I AGLFLGGVGMVGTVAVTVMPQWRVSAFI ENNI WFENFWEGLWMNCVRQAN I RMQCK 
IYDSLIALSPDLQAARGLMCAASVMSFI^^ 

MWLI PVSWVANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCGAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 
AAGCAGT GTT CATGGAGT TCAT TGAAT TAT GGCATAGAT C TT GGGAAAAT T GCGGAGT GTACAT T 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATTGAAGCTATAATTTATTTGGACCAAGGAAGCCCTGAAATGAATTCAACAATTAATATTCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
ATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
T T GGAAT GGT T CACT TAAATGACAT T T TAAATAAGT T TAT GTATACAT CT GAAT GAAAAGCAAAG 
CTAAATATGTT TACAGACCAAAGT GT GAT TTCACACTGT TT TT AAAT CTAGCAT TATTCAT TT T G 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAAT AT AAAAGC T AC C AAT C T T T G T AC AAT T T G T AAAT G T T AAG AAT T T T T T T TAT AT C T G T 
TAAATAAAAAT TAT T T CCAACA 



FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASE I PKGKQKAQLRQRE WDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 12 ft 



y ^ 



™J jj 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCC TGGAAGAATACATCAT GT T T T TC GATAAGAAGAAATT GT AGGAT CCAG T T T TT T T T TTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCC CTAAGCCAAAGCAAAAGACCTAAGGAC GACC T T T GAACAATACAAAGGATGGGT TT CAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
AT AACAGCCTT CAAAAAC T TAAGTAT AAT CAAT T TAAAGGGC T CAACCAGC T CACC TGGC TATAC 
CTT GACCATAACCATATCAGCAATATTGACGAAAATGCT TTTAATGGAATACGCAGAC TCAAAGA 
GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
C GGAAGC T GCTGAGT T TACAT T T ACGGTC TAAC TCC C TGAGAACCAT CCC T GT GCGAATAT T C CA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 



m 

ji s 5 
111 

w 

Rl CATAGGACAGACCATGTCCTGGACCTGGAGCTCCT TACAAAGGCTTGAT T TAT CAGGCAAT GAGA 

* TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
PI T C CAAC AAGC T C AC AT T TAT T G G T C AAGAGAT T T T GGAT T C T T G GAT AT C C C T CAAT GAC AT C AG 

TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T T AAAGGT CT AAGGGAGAATACAAT TAT C T GTGCCAGTCC CAAAGAGC T GCAAGGAG TAAATGT G 
W AT.C GAT G C AG T GAAGAAC T AC AG CAT C T G T G GC AAAAG T AC T AC AG AG AG G T T T GAT C T GGC C AG 

ft! GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
fl TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 

TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGACTCCCAGCACCCAGGAAT T TTATGTA 
GAT TAT AAACCCAC CAACACGGAGACCAGCGAGATGC TGC T GAATGGGACGGGACCC T GCACC T A 
TAACAAAT CGGGCT CCAGGGAGT GT GAGGT ATGAACCAT T GTGAT AAAZ^AGAGCT C T T AAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
G G T CAT TTTCCTCT C AT ACAT AAT CAAC C CAT T GAAAT T T AAAT AC CACAAT CAAT GT GAAGC T T 
GAAC T CCGGT T TAATATAATACCTAT T GTAT AAGACCCT T TACT GAT T CCAT TAATGTCGCAT T T 
GT T T TAAGATAAAACT TC T T TCAT AGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVTRLLS GSAVALVIAPTVLLTMLS SAERGCPKGCRCEGKM^/TCE SQKLQE I P S S I S AGCLG 

LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 

PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 

LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSS 

SGNE IEAFSGPSVFQCVPNLQRLNLDSNKLT FI GQE I LDSWI SLNDI SLAGNI WECSRNICSLVN 

WLKS FKGLRENT 1 1 CAS PKELQGWVI DAVKNYS I CGKS T TERFDLARALPKPT FKPKLPRPKHE 

SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 

SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites . 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAGGAAGGCTT TCTAAAGCTTTGGCAAGGAGTGACACCCGCCAT TT ACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
AT GAGCAT TAT CC CCT T T GGAAAT CAGT CATT GGAGGGAT GAT GGCT GGT G T T ATT GGCCAGT T T 
TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAZ^AGGAAACTGGAAGG 

AAAACCAT T GC GAT T T CGTGGT GT ACAT C AT GCAT T T GCAAAAAT C T T AGC T GAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
AC C ACT TAT G ATACAGT GAAAC AC TACT T GGT AT TGAATACACCACT T GAGGAC AAT AT CAT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAATAAT GAAT C AACCACGAGAT AAACAAGGAAGGGGAC T T T T GT ATAAAT C AT CG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTT TAA. 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALG 1 1 EEEGFLKLWQGVT PAI YRHWYS GGRMVT YEHLRE WFGKS E DEH YPL WKS 
VI GGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGG I RGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 

CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 

AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 

CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 

GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 

CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 

TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 

GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 

AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 

CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 

GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

AGACACAAT TC GC C T TC AAAC T CCACGACAAGGACC C T GAGAAG GCAGC T G CCACC TATGAGCAA 

ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 

CAT CCAGATT GGAGACAT CCAGAT GAGGC CCACGGAGCAGGTGACC TAGTGAC T GTGGGAGC TC C 

TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 

CACGGGATACCACTTCC TGTCCACACCCCGACCAGGGGCTAGAAAAT T TGTT TGAGAT TT TTATA 

TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 

TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 

TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 

TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 

AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 

AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 

CAGAACACCAGGGCC TGGCCCAGTGGATT TCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 

AMAAAAAAA 
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FIGURE 128 



MARPC^ERWRDRLALVTGASGGIG 

PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoYlation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAAT TT T TAACTGATAGTTGTACATATT TGGGGGTACATGTGAT AT TT GGATACATGTATACAA 
T AT AT AATGATCAAATCAGGGTAACTGGGAT AT C CAT CAC AT CAAACAT T TAT T T T T TAT T CT TT 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

AACCT CTGAT CACCTCAT TCT ACT C T C T AC C T CC AT GAGAT C CAC T T T T T T AGC TCCC ACAT GT G 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T TCCATCCATGT TGCTGCAAATGACAGGAT T TCGT TCT T AATT TCAAT TAAAATAACCACACATG 
GCAAAAA 



FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lai 



T T CT G AAG T AACGG AAGCT ACCTTGT AT AAAGACCTCAAC ACTGCT GACC ATGAT C AGCGC AGC CT GG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
TTGAAAAACTTGCTGAAAGTAGAAAGAATATACCTATACCACAACAGTTTAGATGAATTTCCTACCAACCT 
CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 
AAAT T C CCT ATCTG G AAG AAT T AC AT TT AG AT GACAACTC T GTCTCTGCAGT T AGC AT AGAAGAGGGAGC A 
y, TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
H GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 
Q GT CTCACT AGTCTAAAACGCCTGGTTCT AGATGGAAACCTGTTGAACAAT CATGGTTT AGGT GACAAAGTT 

01 TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
W TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
W| CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
J? GATGAT TTGG AC AAT AT AAC AC AACT GAT T CT TCGC AACAAT CC CT GGT ATT GC GGGT GC AAGATGAAAT G 

GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
pi AGGTTCGTGGGATGGCTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGC 
|f| ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 
p CAAACAGC C AGAT AT T AAGAAC CCC AAGC T CACT AAGG AT CAAC AAAC CACAGGGAGT CC CT C AAG AAAAA 

f|| CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 
O ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
fll AAC AGGGGAACGCAGTGAGTACTT GGTCAC AGCC CT GGAGCCTGATTCACCCT ATAAAGT AT GC AT GGTTC 

CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T TC AT AGG AATGGATC GC TCTT CT CAAGGAACT GTGCAT AT AGC AAAGGG AG GAGAAGAAAGGATGACT AT 
GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 
AAGC AATGAACC CAT CTCG AAGGAGGAGTT TGT AAT AC AC ACCAT ATT TCCTCC TAATGG AAT GAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TC AC AC TC ATGATGCT GAAGGACT C ACAGC AG AC TT GT GT TT T GGGTT TT TT AAAC CT AAGG G AGGTGATG 
GT 
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FIGURE 132 



MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 

EELHLDDNSVSAVS I EEGAFRDSNYLRLL FL S RNHLS T I PWGLPRT IEELRLDDNRI ST I S S PSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKYRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 

IKNPKLTKDQQTTGS PSRKT I T I TVKSVTSDT IHI SWKLALPMTALRLSWLKLGHS PAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI I GGAVALVT I ALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
I LE IRETS FQMLP I SNE P I SKEE FVT HT I FPPNGMNLYKNNHSE SS SNRS YRDSGI PDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Ami da ti on site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGGTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
S CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
g| GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
ll| CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCC CTGCGGCCTCT CAGACCT C TGCAGGAGCCT CATGAC CAAGCCCGGCT GC T C 

3 .. » 

™ AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
'„ GACCACT CCAACAGAGCCAGGACTATAT CAACCTCT TCTGCGCCAACATGATGGACT TGAACCGC 

Q AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
Iff TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
C'l AACAGCAGGAAGGATGCT TCGGGGAGCCT GATGCTGAAGATGAAGAATTATCTAAAGC TATTCAA 

5if TATCAGCAGCAT TT T T CGAGGAGAGTGAAGAGGCGAGAAAAACAATTTCCAGATTCTCGCTCTGT 

pi TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 

AATT C T C T T GCC TCATCC T CCC GAG TAGC T GGSAC TACAGGAGCGTGC CACCATACC TGGC TAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTAT^GTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
T T T AGTCC T CAT CCC TTAGATC CT GGAGGGCACGGAT CACATCCT GGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134. 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

p Important features of the protein: 

U- Signal peptide: 

:ll amino acids 1-26 

jffj Transmembrane domain: 

9 amino acids 39-56 

Si Tyrosine kinase phosphorylation sites. 

fl'l amino acids 149-156, 274-282 

CI 

i%| N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE ias 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
T GAAT TACAT CAAAACATCAGAGGT T GT CAGAC TGC CC TATC C T C T CCAAAT GAAAT CTT CAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAATT CCAACCATGAGT TGCC TGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
T AAAACAGGCAAAAGTGGGGC T GGCAAAAGGAGGTAGT CAGGC CGT C CAGAGC T GGCATT TGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGG T CAATAT TGATGT CAC TGAAT T AAT TACAGTGTCCTATAGAAAATGCCAT TAATAAATTAT 
AT GAACT ACT ATACAT TAT GT AT AT TAAT TAAAACATCT T AATCCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 



FIGURE 1*6 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHDI PSGSYWEWS PAYRFDPVRVDI TSKGKMRARYVNYIKTSE 
WRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPM^^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACC TGCT GGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGT GC 
CACCAAC T CGCAC T CAGAC T C T GAAC TCAGACC TGAAATC T T C T CT T CACGGGAGGC T TGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACAGTCAATTTGGG 
AAGC T GT GTGAT CGCCACAAACCT T CAGGAAATACGAAAT GGAT T T T CTGAGATACGGGGCAGT G 
T GCAAGC CAAAGATGGAAACAT TGACAT CAGAAT C T TAAGGAGGACT GAGT CT T T GCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
C CAT CAAGAAGGACC TCCGGCT CT CT CAT GCCCACATGACATGCCAT TGTGGGGAGGAAGCAAT G 
AAGAAATACAGCCAGAT TCTGAGT CACT TTGAAAAGC TGGAACCTCAGGCAGCAGTTGTGAAGGC 
TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGT^ATAGGAGGAAAGTGATGCTG 

C T GCT AAGAAT AT TC GAGGTCAAGAGCT C CAG T C T T CAAT ACCT GCAGAGGAGGCAT GACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
T G T AAT AT CTTTCTGC TAT T GGAT AT AT T T AT T AGT T AATATAT T TAT T TAT T T T T T G C TAT T T A 
AT GT AT T TAT T T T T T TAC T TGGACAT GAAAC T T TAAAAAAAT T C ACAGAT T ATATT TATAAC C T G 
ACTAGAGCAGGTGATGTATTTTTATACAGTAAAAAAAAAAAACCTTGTTU^TTCTAGAAGAGTGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
AT TT G AAAT T GAAC CAAT GAC T ACT T AGGATGGGT T G T GGAAT AAGT T T TGATG T GGAAT T GCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SLANSFLTIKKDLRLSHAHMTCHCGEEAMK 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 

CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACC T CT C T TCAGTAAAGT T GT TAT TGTT C T GATAGAT GCCT T GAGAGATGAT TTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TT T GTGGCT GAAGCAAAGCCACCTACAGT T AC TATGCC TCGAAT CAAGGCAT T GAT GACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
T AAG AC AAG C AAAAG C AGC T G GAAAAAGAAT AG T C T T T TAT GGAGAT GAAACC T GGG T T AAAT T A 
T TCCCAAAGCAT T T TGTGGAAT AT GATGGAACAACCTCAT T T T TCGT GT CAGAT T ACACAGAGGT 
y h GGAT AATAAT G TCACGAGGCAT T T GGATAAAGTAT TAAAAAGAGGAGAT T GGGACAT AT TAAT CC 

n TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
:f AAGCTGAGCGAGATGGACAGCGTGCTGAT GAAGATCCACACCT CACTGCAGTCGAAGGAGAGAGA 

W GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
IP GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
|lj CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
i'p ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
f I CAAT GAGAGAGCAGT T GAGAT T T TT AC AT T T GAAT ACAGTGCAGCT T AGTAAAC T GT T GCAAGAG 

l^t AATG T GC C GT CAT AT GAAAAAGATCC TGGGTT T GAGCAGT T TAAAAT G TCAGAAAGAT TGCAT GG 

PJ GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
S TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
PI TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Tn CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
jfjj CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
O GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
Rl AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
*fi TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
J" CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
lU T C GGAC AG C C T C C C AG C AGAG G T G T G G GAG C T GCAG C T GAGGGAAGAAGAGACAAT C GGCC T GG A 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GACCGAGACAC TCACAGC TT T GT CATCAGGGCACAGGC T TCCT CGGAGCCAGGAT GATC T GT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C TGCACACAGTAT GTAGT TACCAAAAGAATAAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPTOSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SBCWIVLIDALRDDFVFGSKGWMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGAS S TEE VNT PL I LI S S AFERKPGD IRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
C CGAAGAAAT TCAGCAT T CATGAC CAGGAT CACAAAG TAC T GGT CCT GG AC T C T GGGAAT C T CAT 
AGCAGT T CCAGATAAAAACTACAT ACGCC CAGAGATC T T C T TT GCAT TAGC CTCAT C CT TGAGC T 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGT GACAAGGATAAAGGACAAAGTCAT C CATC CC T T CAGC TGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGC71ATTGTAATGAGCCT 

GT T GGGGTGACAGATAAAT T T GAGAACAG GAAAC AC AT T GAAT T T T CAT T T CAACCAGT T TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 

AT T T GAAC T AAT T G TATAAAAACAC C AAAC C T GC T C AC T 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFI CTSCNCNE PVGVTDKFENRKH IE FS FQPVCKAEMS PSEVS D 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

C T AGAGAG TATAGGGCAGAAGGATGGCAGAT GAGT GAC TCCAC AT CCAGAGC T GC C T CCC T T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGT GGAGT T T GCTGTCCACACAT T CAACCAACAGAGCAAGGAC TAC TAT GCCTAC AGAC TGGGG 
CACAT C T T GAAT TCCT GGAAGGAGCAGGT GGAGTCCAAGACTGT AT T CT CAATGGAGCT AC T GC T 
GGGGAGAACT AGGT GT GGGAAAT TT GAAGACGACAT T GACAACT GCCAT T T CCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACAT C T T GAGCC T AAT CATGTAGT G TAGATC AT T AAAC AT CAGCAT T T T AAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 

CTGTGCAGCT CGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGT TCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGC T T GCCAGGAGGCTGGCAGTACAGAG C GCAGC AGCGAGCAAATC C T GGCAAGTGACCCAGC T 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GTGTATTCCCGT TT TACAGATAAGGAAACT GAGGCCCAGAGAGC TGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CT C T AAT GAAATTGTGAAAGCT CCAT G T T T AGAAATAAAT GAAAACACCT GA 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQE LC S LKHCE FWLERGAGLRVTMHQP VLLC LLAL I WLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 7 6-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T T T C AGAT T CAGGGGGTACAT GTGAAGGTT T GT T T TAT GAGT AT ATt'GCATGATGCT GAGG 
TTTGGGGT 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 



h k N-myristoylation site. 

% * iS amino acids 62-68 

o 

81 
W 
If! 
W 

ry 



13 

ill 
Q 

ru 
o 
ru 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
• TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGC CCAGCG CC GG GCT GTGACCCAAGCCGAGC GTGGAAGAATGGGGT T 
CCT CGGGACCGGCACT T GGATTC TGGT GT TAGT GCT CCCGATT CAAGCTTT CCCCAAACCT GGAGGAA 
GCCAAGACAAATCT CTACAT AATAGAGAAT TAAGTGCAGAAAGACCTTT GAATGAACAGAT TGCT GAA 
GCAGAAGAAGAC AAG AT T AAAAAAAC AT AT CCT CC AGAAAACAAG C C AGGT C AGAGCAAC TAT T C T T T 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
Q T AAGAAGC T C C C C AC T T GAT AAT AAGT T G AAT G T GG AAG AT GT T GAT T C AAC C AAGAAT C GAAAAC T G 

0 ATC GATGATTATGACT C TACTAAGAGT GGAT T GGAT CATAAAT TTCAAGATGATCCAGAT GGT CT TCA 
fl TCAACTAGACGGGACT CCT TTAACCGC TGAAGACAT T GTC CATAAAAT CGCT GCCAGGATT TATGAAG 
^ AAAAT GACAGAGCCGTGTTT GACAAGATT GTT T CT AAACTACT TAATCTCGGCCTTAT CACAGAAAGC 
TA CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 

m 

fll TTATGAGGAGGAT CCCAATAAGCC CACAAGCT GGACTGAGAATCAGGCT GGAAAAATACCAGAGAAAG 

1 T GACT CCAATGGCAGCAAT T C AAG AT GGTCTTGC T AAGG GAGAAAAC GAT GAAAC AG TAT CTAACACA 
CI TTAAC CTT GACAAAT GGCTT GGAAAGGAGAACTAAAACCTACAGT GAAGACAACT T T GAGGAACTCCA 

m 

AT AT T T C C C AAAT T T C TAT G C GC T AC T GAAAAG TAT T GAT T C AG AAAAAGAAGC AAAAGAGAAAGAAA. 
.J*; CACT GAT T ACTAT CATGAAAACAC TGATT GACT T TGT GAAGAT GATGGT GAAATATGGAACAATATCT 

III C CAGAAGAAGGT GT T T C CT AC CT T GAAAACT T G GAT GAAAT GAT TGC T C T TCAGAC CAAAAACAAG C T 

fl I AGAAAAAAATGCTACT GACAATATAAG CAAGCT TTT C CCAG CACCAT CAGAGAAGAGTCATGAAGAAA 

CAGACAGTACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAATAT GGAAGCTT GAAGGATT CCACAAAA 
GAT GATAACTCCAACCCAGGAGGAAAGACAGATGAAC CCAAAGGAAAAACAGAAGCCTAT TTGGAAGC 
CAT C AGAAAAAAT AT T G AAT G G T T GAAGAAAC AT G AC AAAAAG G G AAAT AAAG AAG AT TAT GAC C T T T 
CAAAGAT GAGAGACT TCATCAATAAACAAGCTGATGCTTAT GT GGAGAAAGGCAT CCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTG TAA AAATGGCAAAAGATCCAGGAGTCTTTCAA 
C TGTT TCAGAAAACATAATATAGCTTAAAACAC TTCTAAT T CTGTGATTAAAATTT T TT GACCCAAGG 
GT T AT TAGAAAG T GC T GAAT T TACAGT AG T T AACCT T TT ACAAG T GGTTAAAACATAGC T T T C T T CCC 
GTAAAAACTATCTGAAAGTAAAGT TGTAT GTAAGCT GAAAAAAAAAAAAAAAAAAAA 
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FIGURE iso 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS IRS S PLDNKLNVEDVDSTKNRKLI DDYDS TKSGLDHK 
FQDDPDGLHQLDGT PLTAEDI VHKI AARI YEENDRAVFDKI VS KLLNLGL I TE S QAHTLEDEVAE 
VLQKLI SKEANNYEEDPNKPTSWTENQAGKI PEKVT PMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEPCNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 



220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 




N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 



EEAEAIKRIYSSL 



ill 



Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 



N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
G ATG GTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCT TCTAGCTGGAGGGCTGCATGCAGGGAAGGTCAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGC T C TATC T T GGT GCCAAGGAAT CCAAGAGC T T CAC CT TC TAC CGGCGGGACATGGGGC TCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAG GGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
u> TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
*™ CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
O CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
Q TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
|fl GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
U\ GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
^ TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
■M I GAAAT GGC T CGAGC TCAGAAGATAAAAGATAAGTAGGG TAT GCT GAT C CT C T T T TAAAAACCCAA 

tij GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
flj AGAC CT AC T T ACAAAGTGGCATATAT T GCAAT T TAT T T TAAT T AAAAGAT ACCTAT TTATATAT T 

^ T CT T TATAGAAAAAAGTCTGGAAGAGT TTACTTCAAT TGTAGCAATGT CAGGGTGGTGGCAGTAT 

^ AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 

AT TCCT TGT ATAAAAATAAGAAAAGAAAT TAAT C TT GAGGTAAGCAGAGCAGACAT CATCT C TGA 
W TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
Q TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
m GT GGC T G GAAT C TCTGGGTAAGGAAC T T AAAGAACAAAAAT CAT C T GGT AAT TC T T TCC TAGAAG 

l~l GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 

AT TGTGTCCCCCT CAAAT TCACATCC T TCT TGGAAT C T CAG TCT GTGAGT T TAT T T GGAGATAAG 
III GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 

GACTGGTT TCCT TGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGAT GAAGGCAGAGATCGGAGT T TT GCAGCCACAAGC TAAGAAACACCAAGGAT T GTGGCAACC 
AT CAGAAGCT TGGAAGAGGCAAAGAAGAAT T CTTCCC TAGAGGC T T TAGAGGGATAACGGCTCTG 
CT GAAACCT TAAT CT CAGAC TT C CAGCCT CCTGAACGAAGAAAGAATAAAT T TCGGCTG T T T TAA 
GC CACCAAGGATAAT TGGT TACAGCAGCT CTAGGAAAC TAATACAGCT GC TAAAAT GATCCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACT TCCAAGAT TAGGT TATAAAAGACAC TGCAGC T TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGAC TCTGTGAGATAATCGATGT TTGT TGTTTT 
AAGTTGCTCAGTTTTGGTCTAACTTGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 1*2 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEE I SWPNRWLDASLS PVILGVQGGS 
QCLSCGVGQEPTLTLEPWIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 



Interleukin-1 signature. 

!:f amino acids 111-131 



y| Interleukin-1 proteins. 

If! amino acids 8-29, 83-120, 95-134, 64-103 

hi . 

ill 
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FIGURE 153 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
T TGGAGAGAGT GGAGAGAT CAAAGCAAT T GGAGAAC T GGAT TT GCTGT TT AT G T C T C T GAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAAT T AGAT G C CCCAAAGCGAT T TT T T T TAACCAAAAGGAAGAT GGGAAGCCAAACT CCAT CATG 



yl ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGT^AACCAATGCCACTTTTGTTTATA 



AGACCAGAAGGTAGAC T T TCTAAGCATAGATATT TATTGATAACAT TT CAT TGTAAC TGGTGTTC 
T AT AC ACAG AAAAC AAT T TAT T T T T T AAAT AAT TGTCTTTTTC C AT AAAAAAG AT T AC T T T C CAT 
L TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 

w 

f n T G TAT T TAT TAT TAT T AT AAGAC TGCAT T T TAT T TATATCAT T T TAT T AAT AT GGAT T TAT T TAT 

O AGAAACAT CAT T CGATAT TGC TACT T GAG T GTAAGGC TAATAT T GAT AT T T ATGACAATAAT TAT 

Hr AGAGCTATAACATGT TTAT TTGACCT CAATAAACACT TGGATATCCC 

m 
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FIGURE is* 

MAALQKSVSSFmGTLATSCLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTFMLAKEASL 
ADNNTDTOLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFIARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGT GAGGACAGT T C TC T CAT TAGCCTT T TCC TACAGGT GGT TGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
r « CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
In CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
O GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
III GCC CAGCACAGGCAC T T TC TAGATAT T T C C CC C T TGCTGGAGAAGAAAGAGCCCC TGGTT TTAT T 

a 



m 

Viz 

ifl 

w 

rii 



ry 



TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

HI N-glycosylation site. 

f|* amino acids 136-140 

III 

iff Tyrosine kinase phosphorylation site. 

amino acids 127-135 

5 3 2 

s 1!? 

3 

f% N-myristoylation sites. 

tfi amino acids 44-50, 150-156 

%t R 

Q 
pi 



APP_ID=1 0063532 



Page 308 of 320 



FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT T GAGGGACC TCC GAGTAGAAC CT GT TACAAC TAGTGT T GCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT T T GT GT GACGGGCAAAAGCAAC T TCCAGTCC TACAGC TGT G T GAGGT GCAAT T ACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGC T GAAC ACAGT CTAT T TCATT GGGGCCC AT AATAT TCC TAATGCAAAT AT GAAT GAAGATGG 
C C C T T C CAT G T C T G T GAAT T T C AC C T C AC C AGG C T G C C T AGAC CAC AT AAT GAAATA T AAAAAAA 
AGT GT GT CAAGGCC GGAAGCCT GTGGGATCCGAACAT CACT GC T T G TAAGAAGAATGAGGAGACA 
GTAGAAGT GAAC TTCACAACCACTCCCCT GGGAAACAGATACATGGCTCT TATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGT GC T GGTGGCAGGGAT CTAT CTAATG T GGAGGCACGAAAGGATC AAGAAGAC TT C C T T T 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGAT CTAAGAAGCCAGAT TCATCT GCACAAATACG TGGT GGTC TACT T TAGAGAGAT TGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 

MSLVLLSLAALCRSAVPREPTVQCGSETGP 

LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFISPGCLDHIMKYKKKCVKA.GSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWLVAGIYIMWRHERIKKTSFSTTTLLPPIECVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADKWFLLSNDVISiSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-gl ycosyl ation si tes . 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP -dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites . 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 

N ! GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 

S| ACTCAGCTGAAGAAG 



ff| 
HI 

.a:;!3. 

m 

;;■ ! ;;;r 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC TGGCCAAACAAAAACGAAAGCACT CC GTGC TGGAAGTAGGAGGAGAGT CAGGACTCC CAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAG ATG CCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
iy TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 

TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
Q ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
01 GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 

CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
fl CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
UJ GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
f|| TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 

AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
;*f GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
If! CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
P CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
fjjj CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 

CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
C GGGAC T TAAATAAAGGCAGAC GC TGTTTTTC T AAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLEEILVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQE^ETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHEOSfLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLL YSADDS G FERLVGALASALCQIaPLRVAVDLWS RRELSAQG P VAW FH AQ RRQTLQEGGWVLL FS P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 



f 3 1 Signal sequence : 

Iff amino acids 1-20 

1.1,1 

:U~ Transmembrane domain. 

s amino acids 453-475 



N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACAT CC TGACGTGGGAC AGCGGGCCAGAGGGCACCC CAGACACGG T CT ACAGCATCGAGTATA 
AGAC GTAC GGAGAGAGGGACT GGGT GGCAAAGAAGGGC TGTCAGCGGATCACCC GGAAGT CC TGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
TTCTACCACT TAGAGCTCCAGGTCAACCGCACC TACCAAATGCACCT TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
fa CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
h % TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
^ CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
U TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
IP AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
1|| TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
Ifl CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
W GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
ftf CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
s CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
f*% T TCT CT GCAGGAGGTGACCTCC T T GGCT AT GGAGGAAT CCC AAGAAGCAAAAT CAT T GC AC CAGC 

^ CCCT GGGGAT T TGCACAGACAGAACAT CT GACCCAAAT GT GCTACACAGT GGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
CI CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
f|| TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
Ul GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCAGATCCTTGGCTGTCA 
111 ATCCCATGCCTGCCCATGCCAC^C^LCTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAAT GCAGGGAAAC TCCCGAGGT CCAGAGCCCCACC T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAAT C TAGC T C GACAGAGC AT GAG GC CCC T GC CTCTTCTGT CAT T G T T CAAAGG T GG GAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGT CCACCAGGGAGGGAGACACACAAGCC T TT TCT GCAGGCAGGAGT TTCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites . 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE i6s 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC ATG G 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAG GT GACGAT G TATAT GT CTACTACAAAC TGGCAGGGGGATCCCT T GAAC TT T GGGC TGGA 
AGTGTT GAACACAGT TT TGGATAT TT TCCAAAAGAT TTGATCAAGGTACT TCATAAATACACGGA 
AGAAGAGCTACATAT TCCAGCAGATGAGACAGACTT TGTCTGCT TT GAAGGAGGAAGAGATGAT T 
T T AATAG T T ATAAT GTAGAAGAGCT T T TAGGAT CT T TGGAACT GGAGGAC TCTGTACCTGAAGAG 
III TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
111 AC T T GAC CCT GTGCC TGAGCCCGAGGCAT TCAGAGC T GAT TCAGAGGAT GGAGAAGGT GC T TTC T 

||| C AGAGAGCACCGAGGGGC TGCAGGGACAGCCC TCAGCTCAGGAGAGC CAC CCT CACACCAGCGGT 

lit CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 

5 a 

I!! 1 ! ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 

rs s 

AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGT GGACAGTGCGT TATT 

p CATTACAGCAAAGGATTTCGTT GGCATCAAAATCTAAGTT TGTT TTACAAAGAT TGTT TT TAGTA 

III CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 

O AAAAAAAAAAAAAAAAAAAA 

Til 
O 

5> ;•: s ' 
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FIGURE 166 



MAAAPGLLFWLFVLGALWOTPGQSDLSHGRRFSDL^ 

FECKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDL IKVLHKYTEEELHI PADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cQvlP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



W 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT TGAGAAT GAAGCAGAACAGAAGTTAATAGAGAGCATGT T GCAAAACC T GACAAAACCCGG G 
ACAGGGAT T T C TGAT GGT GATT TC TGGATAGGGCTT TGGAGGAATGGAGAT GGGCAAACATC TG G 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGT AT GAACCAGAGAT TAAT CCAACAGCCCC TGTAGAAAAGCCT T AT CT TACAAAT CAACCAG 
GAGACACCCAT CAGAATGTGGT TGT TACT GAAGCAGGTATAAT T CCCAATC TAAT T TATGT TGT T 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACT GTGGAT TTCAAAGAGTA 
CCAGAAAAGAAAGTGGCATGGAAGT ATAA TAAC TCAT TGACTT GGT TCCAGAAT TT TGTAATTCT 
fU GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
a AAGAT GAAC T G T AAGC T CCCCC T T GAGGCAAAT AT TAAAG TAAT T T T T AT AT GT C T AT TAT T T C A 

f J T T TAAAG AAT AT GC T GT GC T AAT AAT GGAG T GAGACAT G C T TAT T T T G C TAAAG GAT G C AC C CAA 

JJJ ACTT CAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTT GT TATCAACACGT CGGGAGTA 

Jj : TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
W TGTATAT TGTATTGAAAT TTACAGTGTGCAAAAGTATTTTACCT TTGCATAAGTGTT TGATAAAA 

flf ATGAACT GT TC TAATAT T TATT TT TATGGCATCTCATT T T TCAATACATGC TCT T TT GAT TAAAG 

PI AAACT TATTAC TGT TGTCAACT GAAT TCACACACACACAAAT AT AGTACCATAGAAAAAGTT TGT 

TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AAT AAGAAGCTATT TCAT TAAGTGTGATATAAACCTCCTCAAACATT TTACT TAGAGGCAAGGAT 
TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
T TGTAATAAT GTAAC T T T G TTAATAGGT GCATAAACAC TAATGCAGT CAAT T TGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGT TGCCTATATAATGAGAAGCAGCTCTC TGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
AC TAT AT T AGTATACAAAGAGGT CATGTGGTT GAGACCAGGTGAATAGTCAC TAT CAGT GTGGAG 
ACAAGCACAGCACACAGACATTT TAGGAAGGAAAGGAAC TACGAAATCGTGTGAAAATGGGTTGG 
AACCCAT CAGTGAT CGCATAT TCAT TGAT GAGGGT T TGC T TGAGATAGAAAATGGTGGC TCCT T T 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



10 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKL I E SMLQNLTKPGTG I S.DGDFW I GLWRNGDGQT SGACPDL YQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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